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Abstract 

We study Markov processes associated with stochastic differential equations, whose non- 
hnearities are gradients of convex functionals. We prove a general result of existence of such 
Markov processes and a priori estimates on the transition probabilities. The main result 
is the following stability property: if the associated invariant measures converge weakly, 
then the Markov processes converge in law. The proofs arc based on the interpretation of 
a Fokkcr-Planck equation as the steepest descent flow of the relative Entropy functional in 
the space of probability measures, endowed with the Wasserstein distance. Applications 
include stochastic partial differential equations and convergence of equilibrium fluctuations 
for a class of random interfaces. 
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1 Introduction and main results 

In the seminal paper [22j, Jordan-Kinderlehrer-Otto have given a remarkable interpretation 
of the solution to a linear Fokker-Planck equation as the steepest descent flow of the relative 
Entropy functional in the space of probability measures, endowed with the Wasserstein distance. 
The book [3] by Ambrosio-Gigli-Savare has provided a general theory of gradient flows in the 
Wasserstein space of probability measures, including linear and non-linear PDE's, in finite and 
infinite dimension. 

In this paper we want to investigate the probabilistic counterpart of such results. The ap- 
proach is analytical and based on techniques from Calculus of Variations and Optimal Transport 
Problems; however several results have important consequences on existence and in particular 
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convergence of Markov processes being reversible with respect to a log-concave probability mea- 
sure. 

Following [3], we interpret the solution (/-ft)t>o of a Fokker-Planck equation with convex 
potential, as a curve in the space of probability measures, solving a suitable differential varia- 
tional inequality. We obtain interesting estimates on fit which have, to our knowledge, no direct 
probabilistic proof, and are very useful in the study of the time-homogeneous Markov process 
{Xt)t>o whose one-time distributions are (/it)t>o- 

1.1 The main results 

We consider a separable Hilbert space H, which could be finite or infinite dimensional, whose 
scalar product and norm will be respectively denoted by (•,•) and || • ||. We denote by ^{H) 
the set of all probability measures on H, endowed with the Borel cr-algebra. 
We consider a probability measure j on H with the following property: 

Assumption 1.1 7 is log-concave, i.e. for all pairs of open sets B, C C H 

log7((l-t)B + tC) > log 7(i3) + t log 7(C) VtG(0,l). (1.1) 

The class of log-concave probability measures includes all measures of the form (here stands 
for Lebesgue measure) 

7 := ^ e~^^^, where V : H = R'' ^Ris convex and Z := [ e"^ dx < +00, (1.2) 

all Gaussian measures, all Gibbs measures on a finite lattice with convex Hamiltonian; see 
Proposition 12.21 and the Appendix for more information on the class of log-concave probability 
measures. 

We denote the support of 7 by = ^(7) and the smallest closed afRne subspace of H 
containing K hy A = ^(7). We write canonically 

A = + h^, h° G K, < ||A;|| y keK, (1.3) 

so that = h^i'j) is the element of minimal norm in K and = H'^i'y) is a closed linear 
subspace of H. As in the Gaussian case, we will say that 7 in non- degenerate if H^{'j) = H. 

We want to consider a stochastic processes with values in ^(7) and reversible with respect 
to 7. We now state a first result which determines such process in a canonical way. We 
denote by Cb{H) the space of bounded continuous functions in H and by C^{A{'-f)) the space 
of all $ : ^(7) I— > M which are bounded, continuous and Prechet differentiable with bounded 
continuous gradient V$ : ^(7) ^ H^il) (notice that all functions in C^{A{"f)) are Lipschitz 
continuous). 

We set n := C{[0,+co[;K) C and we denote by Xt : K the coordinate 

process Xt{u:) := tot, t > 0. We shall endow Q with the Polish topology of uniform conver- 
gence on bounded subsets of [0,+oo[, and the relative Borel a-algebra. On we shall 
consider the canonical cr-algebra generated by cylindrical sets and, for probability measures in 
^[o,+oo[^ the convergence induced by the duality with continuous cylindrical functions of the 
form f{Xt„. ..,XtJ, with / G Cb{K^). 
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Theorem 1.2 (Markov process and Dirichlet form associated to 7) Lei 7 be a log-concave 
probability measure on H and let K be its support. Then: 

(a) The bilinear form £ = £^f^\\.\\ given by 

£{u,v) := j^{Vu,Vv)HO^^~)dj, u, v £ C^{Ai^)), (1.4) 

is closable in ^^(7) and its closure {£, D{£)) is a symmetric Dirichlet Form. Furthermore, 
the associated semigroup {Pt)t>o L'^il) fnaps L°°{'^) in Cb{K). 

(b) There exists a unique Markov family (P^; : x G K) of probability measures on 
associated with £. More precisely, 'Kx[f{Xt)] = Ptf{x) for all bounded Borel functions and 
all X (z K. Moreover, x t-^F^ is continuous. 

(c) For all x e K, F* {C{]0, +oo[; H)) = 1 and E^{\\Xt - xf] Q as t i 0. Moreover, 
P* (C([0, +oo[; H)) = 1 for 7-a.e. xeK. 

(d) {Fx '■ X G K) is reversible with respect to 7, i.e. the transition semigroup {Pt)t>o is 
symmetric in Lp'ij); moreover is invariant for {Pt), i-e. ^{Ptf) = 7(/) for all f G Cf,{K) 
and t >0. 

An example in H := of tiie above setting is provided by (jl.2p when the potential y : M'^ ^ M 
is convex with Lipschitz continuous gradient W : M.^ — > M*^. Then 7 is log-concave, see 
Proposition 12.21 and the process X is a solution of the Stochastic Differential Equation (SDE): 

dXt = -VV{Xt)dt+ V2dW, Xo{x) =x, (1.5) 

where Vl^ is a M'^-valued Brownian motion. One can also consider a convex V G C^'^(C/), where 
U C M*^ is a convex open set, and V = +00 on M*"' \ U. Then X solves the SDE with reflection 
at the boundary dU of U : 

dXt = -VV{Xt) dt+ V2dW + n{Xt) dLt, Xo{x) = x, (1.6) 

where n is an inner normal vector to dU and L is a continuous monotone non-decreasing process 
which increases only when Xt G dU. Equations like (jl.Sp and ()1.6p with convex potentials arise 
in the theory of random interfaces. The invariant measure 7 is typically a Gibbs measure on 
a lattice. Interesting infinite-dimensional examples include Stochastic PDEs with monotone 
gradient non-linearities or with reflection. See subsection 11.21 for an overview of the literature. 

Before stating the next theorem, we define the relative Entropy functional; for all probability 
measures fi on H we set: 

W(/i|7) := / p log pd^ (1.7) 
Jh 

if = /07 for some p G -^^^(7), and -|-oo otherwise. We recall that ^(-17) > by Jensen's 
inequality. 
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We also define the Wasserstein distance: given two probability measures /i, v on we set 



W2(/i, i^) := inf 



\y — (is 



Here r(/Li, v) is the set of all couplings between /i and v: it consists of all probability measures 
T, on H X H whose first and second marginals are respectively and i.e. S(S x H) = n{B) 
and S(F X = z>(S) for all B G We set 



^2{H) := |/i G ^{H) : ^ 



dfi{x) < oo 



It turns out that W2(-,-) is a distance on ^2{H) and that {^2{H),W2) is a complete and 
separable metric space, whose convergence implies weak convergence, see for instance [3l Propo- 
sition 7.1.5]. Then, we have the following result: 

Theorem 1.3 (Estimates on transition probabilities) Let j be a log-concave probability 
measure on H and let (F^) be as in Theorem Fix x G K and denote the law of Xt under 
Px by ff. Then [0,+oo[xK 3 {t,x) ^ vf £ ^2{H) is continuous and 

n{uf\j) < inf / \\y-x\\^da{y) + n{ah)] < +oo Vt > 0, 

so that t'f ^ 7 for all t > 0, x £ K . Moreover, 

W2{vf, vl) < y/2n{u-\-f) V\t-s\, t,s>e,xeK. 

Notice that the estimates given in Theorem 11.31 do not contain any constant depending on H 
or on 7 and appear to be of a structural nature. In the particular case 7 G 0^2{H) we have: 
n{vf\-t) < l^wi{5^,^) < +oo,Vt>0. 

We consider now a sequence (7^) of log-concave probability measures on H such that 7^ 
converge weakly to 7. We denote Kn := K{'^n): '■= ^{in), Hn '■= H^iln), in the notation of 
(|1.3p . We want to consider situations where each Hn is an Hilbert space endowed with a scalar 
product {■,-)h„ and an associated i?-continuous norm || • \\h„ possibly different from the scalar 
product and the norm induced by H. In order to ensure that this family of norms converges (in 
a suitable sense) to the norm of as n — > cx), we will make the following assumption: 

Assumption 1.4 There exists a constant k > 1 such that 

-IMh < \\h\\H„ < 4h\\H V/iGF„, nGN. (1.9) 

K 

Furthermore, denoting by Tin ■ H ^ Hn the orthogonal projections induced by the scalar product 
of H, we have 

lim \\7rn{h)\\H„ = \\h\\H yheH. (1.10) 
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This assumption guarantees in some weak sense that the geometry of Hn converges to the 
geometry of H; the case when ah the scalar products coincide with {■,■)}{, Hn C -ffn.+i and 
UnHn is dense in H is obviously included and will play an important role in the paper. 

Let (P" : X G Kn) (respectively (P^: : x G K)) be the Markov process in [0, +oo[^" associated 
to 7„, (resp. in [0, +oo[^ associated to 7) given by Theorem ll.2i We denote by PI^^ := / P" djuix) 
(resp. P^ := j¥xd'y{x)) the associated stationary measures. 

With an abuse of notation, we say that a sequence of measures (Pn) on C{[a, b]; H) converges 
weakly in C([a, 6]; H^) if, for all m E N and hi, ... , hm G -ff , the process ((X, hi)H-, i = 1, ■ ■ ■ ,m) 
under (Pn) converges weakly in C{[a,b];W^) as n ^ 00. 

In this setting we have the following stability and tightness result: 

Theorem 1.5 (Stability and tightness) Suppose that 7„, — > 7 weakly in H and that the 
norms of Hn satisfy Assumption ]! .4\ Then, for all Xn S Kn such that Xn ^ x £ K in H : 

(a) P^^ Pj; weakly in as n ^ 00; 

(b) for alio < e <T < +00, P^^ weakly in C([e, T];H^); 

(c) for allO<T < +00, P!^^ ^ P^ weakly in C([0, T];Hy,). 

This stability property means that the weak convergence of the invariant measures 7„ and 
a suitable convergence of the norms || • \\h„ to || • \\h imply the convergence in law of the 
associated processes, starting from any initial condition. Notice also statement (b) makes sense. 



because Theorem [L2l^c) gives that our processes have continuous modifications in C(]0, +00 [; H) 
(however, we are able to prove tightness only for the weak topology of H). 

Finally, our approach yields naturally the following 

Theorem 1.6 (Uniqueness in J^2{H) of the invariant measure) Let 7 G J^2{H). If fJ- € 
,^2{H) is an invariant measure of {Pt)t>o, i-e- l^-iPtf) = fJ-if) for all f £ Ch{K) and t > 0, 
then ^ = ^■ 

1.2 Motivations and a survey of the literature 

Existence and uniqueness for stochastic equations like (jl.Sp and ()1.6p in finite dimension are 
classical problems in probability theory, starting from [28] and [31]. In [10] . existence and 
uniqueness of strong solutions are proven for a general convex potential V . The Dirichlet form 
approach is detailed in [16]. 

Natural generalizations of (II. 5p to the infinite dimension are provided by stochastic partial 
differential equations (SPDEs): see Chap. 8 of [13] and [12]. SPDEs with reflection, which 
generalize p.6p . have also been studied: see [24], [33], [34], [11]. Unlike the finite-dimensional 
case, no general result of existence and uniqueness is known, and in fact it is not even clear how 
to define a general notion of solution. 

The main result of this paper is the general stability property of this class of stochastic 
processes, given by Theorem 11.51 if the log-concave invariant measures 7„ converge, then the 
laws P" of the associated stochastic processes also converge. In order to appreciate the strength 
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of this result, notice that convergence of 7^ is a much weaker information than convergence (in 
any sense) of the drift Wn in (jl.Sp . In fact, every approach based either on the SDE or on the 
generator and the Dirichlet form associated with the process, seems bound to give only weaker 
results. 

In the stability result, the limit process is identified by the associated Dirichlet form (jl.4p : 
however, in the general case, we can not write a stochastic equation for the limit, although 
this can be (and has been) done in many interesting situations. Our approach yields existence 
of stochastic processes associated with any Dirichlet form of the gradient type (jl.4p with log- 
concave reference measure: this also seems to be a new result (see [1]). 

Stochastic equations of the form (jl.Sp and (jl.6p are used as models for the random evolution 
of interfaces; in these cases the invariant measure is typically a Gibbs measure on a lattice with 
convex interaction: see [29], [19] and [T7] for the physical background. 

In many interesting cases, the Gibbs measure converges, under a proper rescaling, to a non- 
degenerate Gaussian (or related) measure on some function or distribution space. Convergence 
in law of the associated stationary dynamics to the solution of a stochastic partial differential 
equation is interpreted as convergence of the equilibrium fluctuations of the interface around its 
macroscopic hydrodynamic limit: see [20] and |18j . 

Such convergence results are obtained only in the stationary case and the proofs use very 
particular properties of the model. For instance, the techniques of |18j are based on monotonicity 
properties and can not be applied to many interesting situations. Our Theorem II .51 extends the 
convergence result to more general initial conditions and, being based only on the log-concavity 
of the invariant measures, can be applied to a large class of models. For a different (and weaker) 
approach based on infinite dimensional integration by parts, see [35] and |36| . 

Finally, we notice that log-concave measures are still widely used as models for random 
interfaces: see [27] and references therein. 

1.3 Plan of the paper 

We conclude this introduction with a short description of the plan of the paper: Section 2 
is devoted to the introduction of some basic concepts and terminology, while in Section 3 we 
illustrate the model case when i/ = M'^ and W is smooth, bounded and Lipschitz: here almost 
no technical issue arises and the basic heuristic ideas can be presented much better. In Section 4 
we show the basic convexity properties of the relative Entropy functional needed to build in 
Section 5, by implicit time discretization, a "Fokker-Planck" semigroup in the Wasserstein space 
of probability measures. Section 6 is devoted to the quite strong stability properties of this 
semigroup, and these are used in Section 7 to establish, starting from the smooth case, the link 
with Dirichlet forms. Finally, in Section 8 we canonically build our process in Jf[0'+°°[, and 
deduce its continuity properties from the continuity properties of its transition probabilities, 
provided by the Wasserstein semigroup. Finally, we adapt to our case some general results from 
|23j on the existence of Markov processes associated to Dirichlet forms to obtain the results 
stated in Theorem II. 2l fc). 
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2 Notation and preliminary results 



In this section we fix our main notation and recall the main results on Wasserstein distance and 
optimal couplings. 

Throughout the paper we consider a real separable Hilbert space H. For J C H closed we 
denote by Lipf,( J) the space of all bounded 99 : J i-^ M such that: 

MLip(J) := sup I _ : X, y e J, X y j < +00. 

Measure-theoretic notation. If is a separable Hilbert space, we shall denote by ^{H) 
the Borel cr-algebra of H, and by ^{H) the set of (Borel) probability measures in H. Given a 
Borel map r : H ^ H, the push forward r#fJ- S S^{H) of ^ G ,0^{H) is defined by r^ii{B) := 
H{r-^{B)) for all B G m{H). 

The set of non-degenerate Gaussian measures on which all belong to 0^2{H), will be 
denoted by G{H). Analogously, we shall denote by I^g{H) the c-ideal of Gaussian null sets, 
i.e. the sets B G J^{H) such that fJ,{B) = for all fi G G{H). Lebesgue measure in R*^ will be 
denoted by . 

Wasserstein distance, optimal couplings and maps. We have already defined the class of 
couplings between two probability measures /U and v on H and the Wasserstein distance W2{fi, y)'- 
see (II. 8p . Existence of a minimizing S in (jl.Sp is a simple consequence of the tightness of r(/x, z/); 
the class of optimal couplings will be denoted by ro(^, u): 

ro(^f,i/) := |sGr(^f,i/) : / \\y-xf(n: = Wi{p,v)\. (2.1) 
I JHxH ) 

In the special case when // vanishes on all Gaussian null sets (that corresponds to absolute 
continuity with respect to Lebesgue measure in finite dimensions) it has been proved in Theo- 
rem 6.2.10 of that there exists a unique optimal coupling S, and it is induced by an optimal 
transport map t, namely S = (i x t)#/i (the proof is based on the fact that the non-Gateaux 
differentiability set of a Lipschitz function in H is Gaussian null, see e.g. Theorem 5.11.1 in 
[6]). We shall denote this optimal transport map by fj^. This is one of the infinite-dimensional 
generalizations (see also [E] for another result in Wiener spaces) of the finite-dimensional result 
ensuring that whenever fj, G ^^2(1^^^) is absolutely continuous with respect to then there 
exists a unique optimal transport map that is also the gradient of a convex function. 

When we have a sequence (7^) C ^2{H) as in Assumption [LJl we can introduce Wasserstein 
distances in ^2(^n) using two different scalar products: {•,-)h and (•,•)//„. The Wasserstein 
distance with respect to the former one is indicated in the standard way VF2(-,-), while we 
introduce the notation: 

wIhM^)-=^^^\[ \\y-x\\lyj:: j:er{^i,u)\ , ^,i/G^2(An); (2.2) 

notice that if x, y £ An then x — y £ Hn, so that ||x — y\\H„ makes sense. If /i, u are supported 
in An we also denote the class of optimal couplings in T{^, u) with respect to the -ff„-distance 
by rjy„^o(/x, I/). By (II. 9p the two distances are equivalent. 
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Convergence of measures. We will use two notions convergence of measures: first the weak 
convergence in 0^{H), induced by the duality with Cb{H); second, the convergence in 3^2{H) 
induced by the Wasserstein distance. The two definitions are related by the following result (see 
[3], Theorem 5.1.13 and Remark 7.1.11): 

Lemma 2.1 // C l3^2{H), then Hn ^ fJ- in ^2{H) if and only if fin ^ IJ' weakly and 



lim / 



x\\^ dfXn = I W^W^ dfi. (2.3) 

H J H 



Notice that, for weakly converging sequences (/u„), the convergence of the second moments (|2.3I 
is easly seen to be equivalent to 



Rtoo n^oo J{\\x\\>R} 

We recall that weak convergence of fin to fi implies 



lim lim sup / = 0. (2-4) 

n^oo J\\\x\\>R\ 



n— >oo 



liminf / f dfin > / f dfi, (2.5) 
'H Jh 



for every lower semicontinuous function f : H ^ {—oo, +oo] bounded from below. We shall also 
often use the following extension, involving integration with respect to a variable function: if 
are uniformly bounded from below and equi-continuous, we have 

liminf / fndfin^ / liminf /„ d/i. (2-6) 

n— »oo J jj^ J n^oo 



The proof immediately follows by (j2.5p , with the monotone approximation with the continuous 
functions gk = inf /„. 

n>k 

Log-concave probability measures and Entropy. The concept of log-concavity has been 
introduced in Assumption I l.li Since this concept is crucial in this paper, we recall the following 
result. 

Proposition 2.2 ((T], [3], Theorem 9.4.11) Let H = R''. Then 7 G ^(H) is log-concave if 
and only if it admits the following representation: 

j{B)= [ e-^dJT'^ VSg=^(M'=), (2.7) 

J Bn{V<+oo} 

where V ^ (—00, +00] is a suitable convex and lower semicontinuous function, d> is the 
dimension 0/^(7), and Jif^ is the d-dimensional Hausdorff measure. 

If the dimension of H is infinite, then 7 G ^{H) is log-concave if and only if all the finite 
dimensional projections of 7 are log- concave and therefore admit the representation (|2.7p for 
some V and d. 
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If 7 is log-concave, the relative Entropy functional ^(-17) ()1.7p enjoys a crucial convexity prop- 
erty in terms of Wasserstein distance, which has been discovered by McCann in [25] and further 
extended to the infinite dimensional case in [3]. 

Proposition 2.3 (Displacement convexity of the relative Entropy) Let 7 G J!P{H) be 
log-concave and let /jP, G ^2{H) with finite relative entropy. Then there exists an optimal 
coupling S E To[fi^,fi^) such that the curve in ^3^2{H) 

//* := ((1 -t)7r° + t7r^)^S, {-k' : {x^ ,x^) e H x H ^ e H, i = 0, l) (2.8) 

satisfies 

n{Al) < (1 - tmAl) + iW(/i^|7) Vt G [0, 1]. (2.9) 

When H = M^' is finite dimensional and is absolutely continuous w.r.t. the Lebesgue measure 
then the optimal coupling S = (i x is unique, so that 

= {{l-t)^ + tt)^^l^. (2.10) 

3 From Fokker- Planck equation to Wasserstein gradient flows 

In this section we illustrate the known connections between solutions of the SDE (jl.Sp . solutions 
to Fokker-Planck equations, Dirichlet semigroups and Wasserstein gradient flows in the model 
case when the drift term in the SDE is the bounded gradient W of a smooth function 1/ : M'^ — > 
M satisfying: 

\\W{x)-VV{y)\\<L\\x-y\\ Vx, y G 

for some L > 0. We shall also assume that all derivatives of V are bounded and that 7 = 
exp(— 1/) .if is a log-concave probability measure in M'^. Notice that this implies that V is 
convex, and also (see Appendix EI) that there exist constants A G R and i? > such that 
V{x) >A + B\\x\\ for all x G M''. 

All theories mentioned above have a much larger realm of validity (for instance, much less 
regular drift terms in the SDE (jl.Sp are allowed), but for our purposes it suffices to show con- 
nections and a few a priori estimates in the smooth, bounded, Lipschitz case: more general cases 
will follows thanks to the stability Theorem 11.51 (or its Wasserstein counterpart Theorem 16. ip . 

Let us fix k independent standard Brownian motions {W^ , . . . , W^} on a probability space. 
We consider the M'^-valued Brownian motion iyVt)t>Q, where W = iyV^ , . . . , W^). Since W is 
bounded and Lipschitz continuous, it is well known that, for all a; G M'^, there exists a unique 
solution {Xt{x) : t > 0) of the SDE 

dXt = -VV{Xt)dt+ V2dW, Xo{x) =x. (3.1) 

Notice that {Xt{x) — Xt{y),t > 0) solves almost surely an ordinary differential equation, since 
the stochastic terms dW cancel out; then one easily obtains from the convexity of V that 
1 1— > ||X4(x) — Xt(y)|p is non-increasing in [0,-|-oo[ almost surely. As a consequence, a.s. 

\\Xtix)-Xtiy)f <\\x-yf, Vx,yGM^^>0. (3.2) 
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For all X G M*^, t > and /io G ^(M'=) we set: 

i/f := law of Xtix), fit := / dfiQ{x) £ ,^{R^). (3.3) 



By (j3.2p . the map x i— > i/j? is weakly continuous, and therefore /ij is well defined. Moreover, the 
continuity of the process {Xt{x))t>o yields weak continuity of 1 1— > and t ^ nt- 

It is a trivial consequence of Ito's formula that nt solves the Fokker-Planck equation in the 
sense of distributions in ]0, +cxd[xM'^: 

^^t = A^*t + V-(Vy/it); (3.4) 



this means that 

/ ^dfH= f {A^- {VV,Vv)) d^H V(/.eCr(M^), (3.5) 

and the initial condition at t = is attained in the following weak sense: 

lim [ ^dfit= [ ifdfio, V(/? G C~(]R'=). (3.6) 

Equivalently, p.Sp and (|3.6p can be grouped by saying that for every T > and (p G C^([0, T] x 
M'^) we have 

/ ipTdfiT= / ipdfio+ / (9t93t + A^Jt - (Vy, Vv3j)) d/xtdt. (3.7) 

Proposition 3.1 (Uniqueness and stability of FP solutions) For any fiQ G ^(M.^), the 
Fokker-Planck equation (|3.7p /las a unique solution in the class of weakly continuous maps 
t ^ lit ^ ^(M''). 7/^0 e =^2(1^''), i/ien i/ie unique solution [0,+oo[9 t ^ nt £ ^2{^'') is 
continuous. In addition, fit is stable: fiQ — > ^2{^^) implies fif — > fit in- ^2{^^) for all 

t > 0. 

Proof. We consider first the case /Uq = ^x- it only remains to prove that [0, oo) 3 t ^ £ 
^2{H) is continuous. Taking (j2.4p into account, it is enough to prove that 

sup E[||Xj(x)f ] < C(l + \\xf) < +00, (3.8) 

0<t<T 

sup E[\\Xt{x)\\Hn\xJx)\\>R}] <^{R){^ + \\xf) with u{R) i R ] +00, (3.9) 

0<i<T 

where C and to depend only on T and sup ||V1^||. We apply Ito's formula to ||Xf(x)|| and find 

ft I 

II^J < Ikll +Tsup||V1/|| + (A;- 1) / ds + V2Bt, VtG[0,r], 



that 
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where -B is a standard Brownian motion in R. We define now the process (bt), unique non- 
negative solution of 

ft I 

bt = \\x\\+Tsup\\W\\ + {k-l) —ds + V2Bt, Vt > 0. 

Jo bg 

Then (bt) is a Bessel process of dimension k, i.e. {bt,t > 0) is equal in law to (||6o "^+^2 VFj ||, t > 
0), where /i e with ||/i|| = 1: see [26]. By Ito's formula, \\Xt\\ < bt ioi all t G [0,T], almost 
surely. Then (13. Sp and (13. 9p follow from standard Gaussian estimates. 

Existence of FP solutions, as we have seen, is provided by (|3.3p . Uniqueness can for instance 
be obtained by a classical duality argument: let fij, fit be two weakly continuous solutions of 
(j3.5p . and let a := fi] — fif be their difference, satisfying 

[ ipTdaT= [ [ (dtipt + Aipt-{VV,Vipt))datdt (3.10) 

JIRfe JO JK'= ^ ^ 

for every T > and (p G C^([0, T] x M'^). By a mollification technique, it is not difficult to check 
that ()3.10p holds even for every function G C([0, T] x R^) with dt(p, Vtp and V^^j continuous 
and bounded in [0,T] x M'^. For given ip G C^(M^) we consider the solution (ft of the time 
reversed (adjoint) parabolic equation 

dt^t + Apt-{^V,Vpt) = in(0,^)xR^ ^T = tp- (3.11) 

Standard parabolic regularity theory (it suffices to use the maximum principle [21] and the fact 
that the first and second order spatial derivatives of (p solve an analogous equation) shows that 
(f is sufficiently regular to be used as a test function in (I3.10p : this leads to / ipdar = 0. As 
is arbitrary, we obtain that fi}p = fi'^. 

The representation fj.t = J uf dij,Q{x), given in (13. 3p . and the uniform estimates (j3.8p . (13. 9p 
easily imply the stability property. □ 

Notice that the measure 7 provides a stationary solution of (13. 4p (and it can be actually 
shown that all solutions fit weakly converge to 7 as t —> +00); it is also natural to consider initial 
conditions fiQ = po7 S ^{H) with pQ G Lp'{'^). In this class of initial data, one can consider the 
variational formulation of the FP equation induced by the symmetric Dirichlet form 

£y{p,rf):= [ {Vp,Vrf)dj, p, rf e W^^\R''), (3.12) 
where W^'^{R'') is the weighted Sobolev space 

W^^\R') := |p G l2(7) n W^^liR') : £ ||Vpf ^7 < +cx)| . (3.13) 

Proposition 3.2 For every pQ G L'^{'y): 
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1. there exists a unique p. G W^^^ (]0, +oo[; ^^(7)) such that 



j^{pt,v)LH^)+£j{pt,v) = 0, ^v^W^'^iR''), ^^Pt = Po inL'ij); (3.14) 

if Pmin 1^ Po ^ Pmax; then pt Satisfies the same uniform bounds; 

2- if Po ^ and f pod'y = 1, then pt = Ptl £ ^(H) provides the unique solution of ()3.5p 
starting from pQ = po7)' 

3. if po G Ci,(R^), then, for all t > 0, the function 

Ptpoix) := K{po{Xt{x))) = [ poduf, Vx G M^ (3.15) 



provides a continuous version of pt, i.e. Ptpo{x) = pt{x) for ^-a.e. x; moreover Pt acts 
on Lip(,(]R'^) and 

[Ptp]Lipm<[p]upm i>0, pGLip,(R^); (3.16) 

4- {Pt)t>o has an extension to a symmetric strongly continuous semigroup in Lp'i^). 

Proof. Existence of a unique solution of (I3.14p follows by the well-known theory of variational 
evolution equations, as well as the uniform lower and upper bounds on pt, being 8^ a Dirichlet 
form: this proves point 1. Now, for if G C^(M'^) we can choose r] = 93exp(y) in (|3.14p and 
integrate by parts in space to obtain (j3.5p : this shows that pt is the unique solution of the FP 
equation, as stated in point 2. Continuity of PtPo and (|3.16p follow from (13. 2|) : in order to prove 
that Pt = PtPo 7-a.e., we can reduce by linearity to the case po7 G ^{M.^) and then point 3 
follows from point 2. Point 4 follows from standard -^^(7) estimates for equation (13.14p . □ 

Using the convexity inequality S.y{r],r]) > S.y{p, p) + 2£^{p,rj — p), it is not difficult to show 
that an equivalent formulation of (|3.14p is (this kind of formulation first appeared in [5], in 
connection with nonlinear evolution problems in Banach spaces) 

1 1 \\Pt - riWU,) + l£,{pt,Pt) < 1£,{V,V), Vr? G W^'\R'^). (3.17) 

We are going to show an analogous property of the solutions pt of the Fokker-Planck equation 
(13. 7p . obtained just replacing norm with Wasserstein distance and ^£-y{p, p) with the relative 
Entropy functional ^(^717) with respect to 7. This provides the key connection with the 
variational theory of gradient flows in Wasserstein spaces. To this aim, let us first establish the 
analogue of the convexity inequality for the relative Entropy: 

Lemma 3.3 (Energy inequality) Let p G C^iMf") Pi T4^^'^(M^), with < pmin < P < Pmax < 
+00. Then: 

n{r)-f\^)>n{p-i\^)+ [ {Vp,t-i)d-f, V777G =5^2(M'=), (3.18) 
where t is the optimal transport map between p'j and 777. 
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Proof. We just sketch the proof, referring to [3l Lemma 10.4.4, Lemma 10.4.5] or to [9] for 
more general results and detailed arguments. Let t = V(/>, with (p convex, and u = pexp(—V). 
Defining ijP = p^, p^ = rj-) and /i* as in (j2.8p . taking (j2.9p into account it suffices to bound from 
below ^pp'H(//*|7) at t = with J (Vp, t — i) d'y. Now, a computation based on the area formula 
(that provides an explicit expression for the density of /x* with respect to .Sf'^, see also the proof 
of Lemma lA.ll in the Appendix) gives 

= -/ tiace {V^ (p - I)u{x)dx + {W,t-i)pdj, 

where is the Alexandrov pointwise second derivative of (j) and / is the identity matrix. By 
the convexity of 4> we can bound the matrix of absolutely continuous measures V^c/)^^ from 
above with the distributional derivative of V0 = t (which, in general, is a measure, by the 
convexity of 4>) to obtain 

> -{V ■ {t-i),u) + I {VV,t-i) pdj. 

t=o JR'' 

Finally, we integrate by parts to obtain (j3.18p : although u is not compactly supported, this 
formal integration can be justified by approximation of il with uipR, where ipR E C^(]R'^), 
< V'R < 1, IIVV'rII < 1, Vil T 1 and Vi/jr as R ^ +oo. □ 



dt+ 



dt+ 



Proposition 3.4 For all pQ € J^2{^^) the solution (pt) of the FP equation (j3.4p . characterized 
in Proposition \3.1l satisfies the family of variational evolution inequalities 

—Wiipu v) + n^pth) < 7^(^^|7) (3.19) 
in the sense of distributions in ]0, +oo[, for all v G ^2(1^^^)- 

Proof. First of all, we notice that the variational evolution inequalities p.l9p are stable with 
respect to pointwise convergence in Si^2^^\- indeed, if p^ — > pt for all t, then ^i^2^P^^v) — > 
^VFf (/X(,z/) in the sense of distributions, and the lower semicontinuity of "^(-17) allows to pass 
to the limit as n — > 00. 

Thanks to this remark and to the stability properties of solutions to FP equations, we need 
only to show the property when /xq = /0o7 with po £ C^i^^^ po > pmin > 0. Then, we know from 
Proposition 13.21 that pt = ptj, with pt smooth, pt > Pmin- In addition, writing ut = piexp(— F), 
since uq = poexp(— y) belongs to C^{M.'') as well, standard parabolic regularity theory for 
the FP equation whose drift is bounded, together will all its derivatives, gives dtu, \/u, V^u € 
Cb ([0,r] X M*^) for all T > 0. 

We are interested in getting pointwise bounds for the velocity field Vt := —{'Vpt)/pt', it 
appears in a natural way in this problem because, by ()3.4p . pt solve the classical continuity 
equation 

^Pt + V ■ivtpt) = in]0,+cx)[x]R^ 
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describing the evolution of a time-dependent mass distribution fit under the action of a velocity 
field Vt- Since pt = ut exp(V), and exp(F) is not bounded above, we can not use directly the C| 
bounds on ut- However, we can use the fact that pt solve the linear PDE dtpt = Apt — {Vpt, VT^), 
to obtain 

dtVp = AVp - V VV/9 - V^pVV. 

Taking the scalar product with Vp and using the identity A||Vp|p = 2(AVp, Vp) + 2||V^/9|p, 
we can neglect the nonnegative terms ||V^/9|p and (V^yVp, Vp) to get 

dtWVptf < A\\vptf - (vy, viivptf ). 

The classical maximum principle can now be applied, thanks to the fact that ||Vpf|p grows at 
most exponentially [21^ Page 216] to obtain ||V/9j|p < ||Vpo|P for all t > 0. By the uniform 
lower bound on pt, \\vt\\ is uniformly bounded. 

Now, let us show first that 1 1— > is a Lipschitz map in [0, +00 [ with values in ^2{^'')- Let 
< ti < i2 < +00; the smoothness and the just proved boundedness of Vt ensure the existence 
of a unique flow Y{t,x) : [0, -|-oo[xIR'^ R'' associated to Vt, i.e. Y{0,x) = x and ^Y{t,x) = 
Vt{Y{t,x)) in [0,-|-oo[. Then, the method of characteristics (see e.g. O Proposition 8.1.8]) for 
solutions to the continuity equation shows that pt is given by Y{t,-)^pQ for all t S [0,-|-cxd[. 
Therefore we can use the coupling {Y{t2, ■),Y{ti, ■))#po to estimate ^2(^^*1,1^*2) ^ follows: 



Wi{pt„Pt,)< [ \\Y{t2,x)-Y{ti,x)fdpo= [ II / 



t2 

vt{Y{t, x)) (itf dpQ < C{t2 - tif. 



ti 

This proves that t ^ pt Lipschitz in [0, +00 [. 

To conclude the proof, it suffices to check (|3.19|) at any differentiability point t £ [0, -|-oo[ of 
the map t ^ W2{pt, v)- Let t be the optimal transport map between pi and v, and let h > 
since S = {Yit + h, ■),t o Y(t, •))^po is a coupling between pt+h ™d u, we can estimate (using 
the identity ||a|p - ||6|p = {a + b,a- b)): 

Wi{pt+j,,u)-Wiipt,,^) ^ 1 f , , ^,,,2^,. 1 



h 

1 
h 



<\ f \\t{Y{i,x))-Y{i+h,x)fdpo-\f \\t{y)-yfdpt 



t{Y{i,x))-Y{t + h,x)r -\\-tiy{t,x)) -Y{t,x)\\' dpo 
{2t{Y{t, x)) - Y{t + h,x)- Y{t, x), ]- {Y{t + h,x)- Y{t, x))) dpo 
{t{Y{t, x)) - Y{t, x), Vt{Y(t, x))) dpo = -2 [ {t{y) - y, Vt{y)) dpi, 

as /i I 0, by dominated convergence. Prom the energy inequality p.lSp we obtain (j3.19p . □ 

Starting from (j3.4p . we have derived a new relation (j3.19p satisfied by solutions to FP equa- 
tions, at least when H = M}' and V is smooth, with \/V bounded and Lipschitz. The idea 
of [3] is to consider (j3.19p as the definition of a differential equation in a space of probability 
measures endowed with the Wasserstein distance even when either V is less regular or H is 



14 



infinite-dimensional: more precisely, {nt)t>o is said to solve the gradient flow of the functional 
^(•17); the choice of 5x as initial measure is the one that provides the link with the laws vf of 
the solution Xt{x) of the SDE dXT]) . 

Notice that (I3.19P is defined only in terms of the Wasserstein distance and the relative en- 
tropy, namely of objects which make perfectly sense on an arbitrary Hilbert space H. Motivated 
by Proposition 13.41 we set the following: 

Definition 3.5 (Gradient flows) 

Let F : ^2{H) ^ [0, -|-oo] and set D(F) = {F < -|-oo}. We say that a continuous map 
Ht :]0,+oo[i-^ 3^2{H) is a gradient flow of F if 

— -VF|(;U(, v) + F{fit) < Fi'^) the sense of distributions in ]0, -|-oo[, for all v G D{F). 

(3.20) 

We say that fit starts from fiQ if fit fio ^2{H) as t 10. 

The terminology "gradient flow" can be justified, by appealing to Otto's formal differential 
calculus on £p2{^^)] since this calculus will not play a significant role in our paper we will 
not discuss this issue, and refer to [22], [3], [32] for much more heuristics on this subject. 
Here we just point out that existence of gradient flows can be obtained (see Section [5]) by the 
analogue in the Wasserstein setting of the Euler scheme for the approximation of gradient flows 
x{t) = —'VF{x{t)): namely, given a time step r > 0, we build a sequence (xk) by minimizing 

^\\y - ^kf + ^iy) 

It 

recursively (i.e., given x^, we choose x^+i among the minimizers of the variational problem 
above). Looking at the discrete Euler equation, (x^+i — Xk)/T = —'VF{xk+i), it is clear that 
Xk ~ x{kT). 

Notice also that (j3.20p implies that fit = fJ-o € ^2{H) is a constant gradient flow if and only 
if fiQ minimizes F; in the case F{fi) = 7i{fi\'y), since t i— > tint is strictly convex, the unique 
minimizer of is = 7. So, from the gradient flow viewpoint, we easily see that the unique 
invariant measure in I^2{H) is 7 if 7 G ^2iH): see Theorem 11.61 and its proof in section [HI 

In the next sections we are going to adopt the "gradient flow" point of view, and prove that 
the results of Theorems II. 2^ 11.31 and 11.51 are relatively easy consequences of this approach. 

4 Implicit Euler scheme 

In this section we construct a discrete approximation of the gradient flow. Such construction 
is based on the following convexity property, which is a stronger version of the one given in 
Proposition 12.31 

Definition 4.1 We say that F : l!p2{H) [0,-|-oo] is strongly displacement convex if for any 
fl,i'Q,L'i £ ,'^2iH) there exists a continuous curve u : [0,1] ^ ii^2{H) such that v\t=i = ^i; 
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i = 0, 1, such that: 



Wiiut,fl) < {l-t)Wiiuo,fl) + tWiii^i,fi) 
F{vt) < il-t)F{i^o) + tF{i^i) 



t{l-t)W^{uo,i^i) 



VtG[0,l]. (4.1) 



In this paper we consider only the case F{iJ,) := 7i{n\^), where 7 is a log-concave probability 
measure on H. In fact the following results are true for much more general classes of strongly 
displacement convex functionals F : 3^2{H) (—00, +00] with essentially the same proof (under 
suitable lower semicontinuity and coercivity assumptions). However at one point we shall take 
advantage of a particular feature of the relative entropy functional 7i{-\j), namely the entropy 
inequality (|4.4|) . in order to simplify the proof. For more general cases, see Chapter 4 in [3]. 

In order to build gradient flows, we use an implicit Euler scheme, at least when ft £ F){F); 
then, suitable Cauchy-type estimates provide existence up to initial data in D{F), as in the 
Hilbertian theory. The scheme can be described as follows: given a time step r > 0, we define a 
"discrete" solution /i^ setting jj,^ = jl and, given we choose i^t^'^^ as the unique minimizer of 



The fact that this is possible is a consequence of the following 



(4.2) 



Proposition 4.2 Let r > 0. For all /i G 0^2{H) there exists a unique fir £ ^2{H) such that: 

\/ve^2{H). (4.3) 



W(^r|7) + ^^l(/"r,/^) < 'n{^\l) + ^Wi{v,^i) 



The existence part of this result is rather standard and relies on tightness and lower semicon- 
tinuity arguments. The uniqueness statement is based on the strong displacement convexity of 
the relative Entropy functional, proved in Proposition 14.31 

Proof. Existence. Let Vk = fkl £ ^2{H) be such that: 



lim 

fc— >oo 



F{uk) + i^Wi{uk.h^) 

IT 



inf 



F{y)^^Wl{v,^C) \ < +00. 



In particular we have that (z^^) is bounded in ^2{H) and limsup^. 'H(i^fc|7) < 00. By using first 
the inequality tint > —e~^ and then Jensen inequality we get 



-j{H\E)+n{ukh)> f fkln fkdj >iyk{E) In y E e m{H), 



(4.4) 



so that 'j{E) implies sup^, f/t (E') 0. It follows that (z/^) is tight in H, so that we can 
extract a subsequence, that we can still denote by (i^k), converging weakly to some /Xr G ^^2{H). 

If we prove that both F and VFf (•, //) are lower semicontinuous with respect to weak conver- 
gence, then we have that fir realizes the minimum in (|4.3p : indeed by lower semicontinuity: 



F{l^r) + ^W^{fir,fi) < liminf 

AT k^oo 



+ ^wi{uk,fi) 



v&.9>2{H) I Zt 
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A nice representation of the relative entropy functional is provided by the duality formula (see 
for instance Lemma 9.4.4 of [3]): 

nin\j) = sup!^j Sdfi- J {e^-l)dj: 5GCb(i?)|. (4.5) 

This formula immediately implies that "^(-17) is sequentially lower semicontinuous with respect 
to the weak convergence. 

Let now G ro(i^fc,/i) and assume with no loss of generality that W2{i^kifJ') converges to 
some limit; since (uk) is tight in H, (S^) is tight in H xH and we can assume that (S^) converges 
weakly to S G ,'^{H x H). Obviously S G T{iy,^) and the weak convergence of S„ gives 

/ ||x — y|p < liminf / — (4.6) 

JhxH JhxH 

Bounding (/^^i ^) fi^om above with J \\x — we obtain the lower semicontinuity property. 

A similar argument also proves the joint lower semicontinuity of (/i, v) i-^ M^2(/^i i^) (we will use 
this fact at the end of the proof of Proposition [ 



Uniqueness. Suppose that fir / Mr realize the minimum in (j4.3p . denoted by in. Let vt be 
a curve between jlr and fir given by Proposition 14.31 below, with the choice Jl = ji. Then we 
obtain: 

1 1 

^(i^l/2) + ^W^I(j^l/2,/^) <m - - W^{flr,flr) < m, 

which is a contradiction. □ 



Proposition 4.3 Let 7 G 3^{H) he log-concave. Then the functional 'H{-\^) : J^2{H) — > 
[0, +cxo] is strongly displacement convex. 

Proof. As shown in [3], it is often enough to build the interpolating curves only for a dense 
subset & of measures fi. In the case of the relative entropy functional (but also for more general 
classes of functionals, see [3]) the set & is made by finite convex combinations of non-degenerate 
Gaussian measures; as & is easily seen to contain finite convex combinations of Dirac masses, & 
is dense in ^2{H). Moreover, any measure in & vanishes on the class ^g{H) of Gaussian null 
sets; hence, for any fi £ ^ and vq, £ ^2{H) we can find optimal transport maps between 
and Ui, i = 0,1. We define 

iyt:= {{l-t)ro + tri)^fi. (4.7) 
Let us first check the Lipschitz continuity of t ^ ft G ^2{H): for s, t £ [0, 1], the coupling 

Est := ((1 - s)ro + sri, (1 - t)ro + tri)^ 

belongs to T{i's,i't), so that 

Wi{vs,vt) < / \\x - yf dJ^st = \t - s\'^ \\ri-rofdfl 

JhxH JH 

< 2\t-smwi{ui,fl) + Wi{iJo,fl)). 
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Now, let us check the convexity of t 



(l-t)(ro 



i) + t{ri - i 



H 



< 



t) 



H 



dfi + t \\ri — iW"^ dfi — t{l — t) / ||ro — rill^ d// 



H 



In the last inequality we used the fact that (ro, ■ri)#/i G r(fO) ^i)- For the convexity of 1 1-^ P{^t), 
achieved through a finite-dimensional approximation, we refer to [3l Theorem 9.4.11]. 

Having built the interpolating curves when € in the general case, we can approximate 
any £ l3^2[H) by measures /i" G S>] notice that the interpolating curves t i— > between uq 
and ui are equi-Lipschitz and, for t fixed, the same tightness argument used in the existence 
part of Proposition 14.21 shows that (ff) is tight. Therefore, thanks to a diagonal argument, we 
can assume that — > ut weakly for all t G [0, 1] H Q, with t ^ vt Lipschitz in [0, 1] H Q. Passing 
to the limit as n — > cx) in the convexity inequalities relative to i^", and using the weak lower 
semicontinuity of F(-) and W|(-, •) we get 



(4.8) 



iw^ii^ufl) < (1 - t)Wi{uo,fl) + tW^{'^i,fl) - til - t)W^iuo,ui) 
[Fii^t) < {l-t)F{uQ)+tF{ui) 

for all t G [0, 1] HQ. By a density argument, based on the completeness of ^2{H), we can obtain 
a Lipschitz curve vt defined in the whole of [0, 1], still retaining the inequalities above. □ 

We prove now an important estimate which plays a key role in the sequel, see the proof of 
Theorem 15.11 below. 

[0, +00] he strongly displacement convex, let p, G 3^2{H) 



Proposition 4.4 Let F : 0^2{H) 
and let fij- be a minimizer of 



Then 



Wi{fir,u)-Wi{j2,ij) < 2T[F{i^)-F{f,r)], yv£D{F). 

Proof. Let vq = fi^, vi = v and consider the interpolating curve vt '■ [0, 1] ■ 
which (|4.ip holds. The minimality of fir and (j4.ip give 



(4.9) 

^2{H) along 



<(l-i) 



F{fir) + ^Wi{fJir.ll) 



+ t 



1 



F{v) + j-Wi{u,li) 



t(l_t)±VF|(/i,,i/) 



Subtracting F{iir) + (/^t; /^)/2t from the left hand side of the first inequality and from the 
right hand side of the second inequality, and dividing by f > we obtain: 



1 



F{u)-F{fir) > -W2'{f^r,fl) 



1 

27 



l-t 
2t 



> _[(l_t)VF|(^.,i.) - Wi{u,fl) 
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Letting t J, we have 

Wi{^ir,u)-Wi{ly,^l) + Wii^ir,^l) < 2t(f(i/) - (4.10) 
which yields (|4.9p by neglecting the nonnegative term [fi-j- ,p,). □ 

5 Existence and uniqueness of gradient flows 

In this section we prove existence and uniqueness of gradient flows and convergence of the 
approximations (j4.2p . Again the results of this section hold for more general classes of strongly 
displacement convex functionals, but we are only interested here in the case F{-) = 7i{-\-y), 
where we consider a fixed log-concave probability measure 7 on H. 

We go back to the sequence (/i^)fc defined recursively by (|4.2p with fi^ = JI ^ J^2{H), the 
existence of {n'^)k being granted by Proposition 14. 2i We shall denote the "discrete" semigroup 
induced by /x!^ by ^^fl{t), precisely 

S^rfiit) := 1-4'^^ Vt G {kT, (k + l)r]. (5.1) 

Theorem 5.1 (Existence and uniqueness of gradient flows) For any p, £ ^2{K) there 
exists a unique gradient flow starting from fi. The induced semigroup 5^Ji{t) satisfies 

W2{yfl{t),'^Hs)) < ^2F(/i)V|t-s|, t,s>0, fie ^2{K) (5.2) 
and the following properties: 

(i) (Uniform discrete approximation) W2{yfi{t),yrp'{t)) < C^/tF(P) if fl & D{F), 
with C = 2(2\/2 + 1); 

(ii) (Contractivity) W2{yii{t),,9'h'{t)) < W2{fi,u); 

(Hi) (Regularizing effect) F{.9'li{t)) < inf 5jVF|(/i,i/) + F(i/) < +00 for all t > Q, 
fl€DiF). 

Proof. We first sketch the proof of uniqueness of gradient fiows, referring to jSj Corollary 4.3.3] 
for all technical details: if fJ-^(t), fi'^{t) are gradient flows starting from p., setting u = fj.^{t) into 

~Wi{f^\t),u)<F{u)-F{f^\t)) 

and = /U^(t) into 

±^wi{^,\t),u)<F{u)-F{^,Ht)), 

one obtains that ■^W2 {fJ^^ (t) , fi^ {t)) < 0, whence the identity of fj,^ and fi^ follows. 
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In order to show existence of gradient flows, we consider first the case when /i G D{F). 
Notice that ^^'^^ satisfies 



F(/z^i) + l-WK^i'+\^Jl) < F{u) + G D{F) 

and, choosing in particular u = fj,^, we obtain that -F(/i^"^^) < F{n^) and 

W2{l4^\lJ;) < V2r[F(M^)-F(M^i)]. 
This inequality easily leads to the discrete C^/^ estimate 

Moreover a crucial role is played by the formula 



s + t\. 



(5.3) 

(5.4) 

(5.5) 
(5.6) 



W^{yrfi{ik + l)r), I/) - W|(^,/i(fcT), I/) < 2T[F{iy) - F{yrll{{k + l)r))] 

for all z> G D{F), which follows from Proposition I4.4[ 
Proof of (i). We start proving the estimate 

W^i^rHt), -^^m) - W^ifl, v) < 2tF{u) (5.7) 

for all r > and all times t that are integer multiples of r. To this aim, from (15. 6p we obtain 
the inequalities 



wl{5^^^u{i),e)-w:^{v,e) 



< T 



F{9)-F{yr_u{-)) 



Wi{yrv{r),e)-Wi{yrv{z)^e) <T F{e)-F{5^rv{r)) 



for all 6 G F){F), whose sum gives 



W2\^rHr),e) - Wi{u, e)<T 2F{9) - F{yr_y{-)) - F{yr_y{r)) 



(5.8) 
(5.9) 

(5.10) 



for ah e€ D{F). Still from ([SSD we get 

wi{yrii{T),e) -w^{fi,e) <2T[F{e) - F{yri2{T))] yeeD{F). (5.11) 

Setting 6 = J^t-/z(t) in (jS.lOp and ^ = in (jS.lip . we can add the resulting inequalities to obtain 
H^|(^,/i(r),^zz7(r))-l^|(^./z(0),^p(0)) < r (2F(z^) - F(^zz^(^)) - F(^zz7(r))) 

< 2TfF(z^)-F(^.z^(r))V (5.12) 



Notice that (|5.12p corresponds to (|5.7p with t = r; by adding the inequalities analogous to ()5.12p 
between consecutive times mr, (m + l)r, for m = 0, . . . , — 1, we obtain 



(5.13) 
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that yields (j5.7p because F is nonnegative. Now, from (j5.7p with fi = v we get 

for all t that are integer multiples of r/2™, so that 

T^2(^^/i(t),^^/i(t)) < 5]2-*/2y27FM (5.14) 

i=m 

for all n > m > j and all t that is an integer multiple of . For any such t (and therefore on 
a dense set of times) the sequence {yj_p,{t)) has the Cauchy property and converges in ^^2{H) 
to some limit, that we shall denote by J^(^)/u(i). 

Using the discrete C^/^ estimate (jS.Sp we obtain convergence for all times, as well as the 
uniform Holder continuity ()5.2p of 1 1— > oS^(T-)/i(t). 

We prove now that (oS^(T-)/i(t))t>o is a gradient flow starting from Jl. Indeed, we can read 
()5.6p as follows: 

^ (^rA(i), ^) < T j;[F(z.) - F(^./2(^r))]5, 

i=l 

for all V G D[F), in the sense of distributions. Passing to the limit as n — > co in the previous 
inequality with r replaced by t/2", the lower semicontinuity of F gives 

|^W^I(^{r)/i(i),^) < [i^(^) - i^(^M/i(t))] Vz. G I)(F) 

in the sense of distributions. This proves that =^(^)/2(t) is a gradient flow starting from /2, and 
since we proved that gradient flows are uniquely determined by the initial condition, from now 
on we shall denote ^fl{t) = ^(^^-^^{t). 

Proof of (i). Passing to the limit as n — > oo in ()5.14p . with m = j = 0, we obtain that 
W2{'5^p-{t),S^rJj'{t)) can be estimated with 2(\/2 + 1)a/tF(/2) when t/r is an integer. From 
(|5.5p . (|5.2p and the triangle inequality we obtain (i). 

Proof of (a) when fx G D{F). It suffices to pass to the limit as r | in (15. 7p . 
Proof of (in) when p, G D{F). By adding the inequalities 

Wi{S^rm + l)r), y) - Wi{^rfi{iT),u) < 2T[F{iy) - F^^rKii + l)r))] 

< 2t[F{u) - F{yrJi{NT))] 

for z = 0, . . . , — 1 we get 

W|(^^/Z(A^r),i/) - W|(/i,i/) < 2Nt[F{v) - F(^^/2(iVr))]. 

Replacing now r by r/2'^ in this inequality, and defining N as the integer part of 2'^t/r (so 
that Nt/2^ t), we can let m — > co to obtain (iii), neglecting the term VF|(oS^T-/i(A^r), u). 
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In order to prove (ii) and (iii) when p, £ D{F) we use a density argument. Indeed, let fin G D{F) 
be converging to p, £ D{F) in ^2{H): by (ii) we obtain that 5^Jin{t) is a Cauchy sequence for 
all t > 0, and therefore converges to some limit, that we shall denote by S^]l{t). It is not difficult 
to prove by approximation that yji{t) is a gradient flow, and it remains to show that it starts 
from p. We have indeed W2{y p-it) , ^ p.n{t)) < W2{pn,p), so that 



6 F-convergence and stability properties 

In this section we consider a sequence (7^) of log-concave probability measures on H weakly 
converging to 7 and a sequence of Hilbertian norms on Hn = H^{'yn) satisfying Assumption II .41 
We are going to prove that the gradient flows associated with 'H(-|7n) with respect to W2,h" 
converge to the gradient flow associated with TC{-\-y) with respect to W2, where the notation 
W2,H" has been introduced in (j2.2p . 

This result is natural in view of Theorem 15. 11 since the discrete approximating flow 
of 'H(-|7n) are defined only in terms of 7^ and W2,h^- However, the same result is much less 
obvious in view of the connection with the Fokker-Planck equation (13. 4p and the associated 
stochastic process {Xt)t>Q'- see Sections [7] and Section [H 

The main result of this section is the following: 

Theorem 6.1 (Stability of gradient flows) Suppose that (7^) C 0^{H) is a sequence of log- 
concave probability measures converging weakly to ^ £ ^{H) and that Assumption \1.4\ holds. 
Let /i" E ^2{^n) o,nd let ipt')t>o be the gradient flows associated with ^(-1771) in 3^2{^n) with 
respect to W2,H"- 

If pn converge to p £ ^2{A) in ^2{H) then /i" fit in I^2{H) for every t £ [0,-|-oo), where 
{fJ,t)t>o is the gradient flow associated with 7i{-\j) in ^2iA) with respect to W2- 

The crucial property in the proof of this stability result is the T-convergence of the functionals 
^(■|7n) to ^(-17). The concept of F-convergence is due to De Giorgi and is a classical tool of 
Calculus of Variations. 



Lemma 6.2 (Convergence of entropy functionals) If S ^{H) converge weakly to j £ 
^{H) then W(-|7„) : ^2{II) [Q,+oo] T-converge to W(-|7) : ^2{II) ^ [0,+oo], i.e. 

(i) for any sequence (/.f„) C ,0^2{II) converging weakly to fi £ ^2(11), we have 




Letting n ^ 00 we obtain that fi{t) fi as t [ Q. 



□ 



liminf W(/i„|7„) > W(/u|7); 



(6.1) 



(ii) for any fi £ ^2(11) there exist /i„ £ 3^2{II) converging to fi in 3^2{II) such that 



limsup H{lln\ln) < 'HifJ'h)- 



(6.2) 
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Proof. The "liminf" inequality (i) in the definition of F-convergence follows directly from the 
duality formula (j4.5p : if — > ^ weakly, for all bounded continuous 5 : — > M we have 



[ Sdfi- [ (e^-l)d7= lim [ S dfin - f 
Jh Jh [Jh Jh 



(e^ - 1) djn 



< liminf 7Y(|U„ 1 7n). 



IH 

Taking the supremum in the left hand side the liminf inequality is achieved. 

In order to show the lim sup inequality we first notice that, by diagonal arguments, we 
need only to show it for a dense subset ^ C D{7i{-\'y)); here density should be understood 
in the sense that for any i/ G D{7i{-\^)) there exist Un £ ^ converging to in 1^2{H) with 
'H{i'n\l) — *■ "^(z^It)- Let us check that 



:= e <^2{H) : / G a(^), / > 0, e > o} 



1 1 2 

has these properties: indeed, in this case, given ^ = gj ^ with g{x) = e~^"^"H f[x), we can 
simply take ^„ = Z'^gjn, with Z„ := f^jgdja, to achieve the lim sup inequality. The "density 
in energy" of ^ in the sense described above can be achieved as follows: first, using the density 
of Cb{H) in L^i'y) and the dominated convergence theorem, we see that any fi = pj £ ^2{H) 
with p G L°°{'j) can be approximated by elements of A truncation argument then gives that 
any p G D{7i{-\^)) can be approximated. □ 

In order to clarify the structure of the proof of Theorem 16.11 it is useful to introduce the 
following concept: we say that pn £ ^{Hn) converge with moments to p £ ^{H) if /i„ — > ^ 
weakly in ^{H) and WxWjj dpn — > f^j WxWjj dp. Notice that for any open set A C H we 
can use ()2.6p to obtain 

liminf / = liminf / ||7r„(x)|||^ > / WxWfj dp (6.3) 

J A J A J A 

whenever pn ^ p weakly in ^{H). Therefore, in the proof of convergence with moments, only 
the lim sup needs to be proved. 

Lemma 6.3 Convergence with moments is equivalent to convergence in 3^2{H)- 
Proof. If Pn p with moments, (|6.3p with A = < i?} gives 

R^oo n^oo J{\\x\\h>R} R^oo n^oo J{\\x\\h>R} 

< lim / ll^ll^ d/x = 0. 

^-^"^ J {\\x\\h>R} 

We obtain the convergence in ^2{H) from (12. 4p . Conversely, \i pn ^ p weakly, (j2.6p gives 

lim / ||vr„(x)|||^ A dpn = / \\x\\h A dp Vi? > 0. 

Jh„ " Jh 

If Pn p in J^2{H), we can use (j2.4p and ()1.9p to obtain limsup„ /r||„|| >ri \\x\\'jr dpn ^ as 

1 1 1 1 1 H fi — J 

R ^ oo, and if we combine this information with ()6.4p we obtain the convergence with moments. 

□ 



(6.4) 
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Lemma 6.4 Assume that f„ £ 3^{Hn), that S„ G r(/i„,f„) is converging to S G r(^,z^) 
weakly and that fin ^ fJ- with moments, while dv^ is bounded. Then 



lim / {x,y)jj^dT.n = / {x,y)jjdT.. 

Jh„xH„ JHxH 



Proof. We prove the liminf inequality only, the proof of the other one being similar. With the 
notation of (jl.lOp . we have {Trn{x),TTn{y)) — > {x, y)jj- as n — > oo for all x, y & H. For all e > 
the functions ^ 

are nonnegative, and these functions are equi-continuous in H x H hy (|1.9p . Therefore (|2.6p . 
thanks to the convergence assumption on gives 

liminf/ |||7r„(y)|||^^ + 2(7r„(x), 7r„(y))^ dS„ > / ^\\y\\jj + 2{x,y) dl2. 
JhxH ^ JhxH ^ 

Using the boundedness assumption on we can obtain the liminf inequality letting e | 0. 

□ 

In the proof of Theorem 16. II we need some continuity /lower semicontinuity properties of the 
Wasser stein distance. 

Lemma 6.5 Let /in, i^n ^ ^2{Hn) he such that ^ // E ^2{H), Vn ^ v ^ 1^2{H) weakly in 
3^{H). Then: 

( i) W2 < lim inf W2,Hn (^n , i^n ) ; 

n— >oo 

(ii) if Hn and i/„ ^ in ^2{H), then VF2,H„(/^n, t'n) ^ W2{fi, y). 

Proof. (i) Without loss of generality, we can assume that the liminf is a limit. Let S„ G 
^Hn,o{fJ-n, ^n)- Notice that tightness of and {vn) in H implies tightness of m H x H. 
Let S G r(;U, u) be a weak limit point of which obviously belongs to r(^, u). Then, taking 
into account the equi-continuity m. H x H oi the maps |Kn(2; — ?/)||_f/„i ensured by (jl.Op . by ()2.6p 
we get: 

W2{n,y) < / ||y - xll^dS < liminf / ||7rn(3; - 

= liminf/ ||y - = liminf W2^^(/i„, 

(ii) We choose optimal couplings S„ between and v^, relative to Hn, and prove that any 
weak limit S (which exists, possibly passing to subsequences) is optimal. The same truncation 
argument used in Lemma 16.31 to show that convergence in ^2 (H) implies convergence with 
moments shows that 

lim / \\y - x\\h dT.n = Wy-xWfjdT.. 



HxH 
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In order to prove the optimality of S we recall that S G r(A, v) is an optimal coupling (relative 
to the cost c{x,y) = \\x — y||^) if and only if for any £ € N, any (xj, yi)j=i^...^£ in the support of 
S and any permutation a of {1, ... ,i} the following inequality holds: 

I e 

i=l 1=1 

see for instance [3l Theorem 6.1.4]. Since S„ is optimal, a similar inequality holds with || • \\h^ 
instead of || • \\h for ah {x'^,y'^)i=i,...,e in the support of S^. Since Yj^ converge to S weakly, 
for any (xj, yj)j=i^...^^ in the support of S there exist {x"^ ,yf)i=i^___^£ in the support of S„ with 
(x", y") {xi, yi) in H X H. Then ()6.5p follows taking limits as ?z — > oo and using the fact that 
Zn £ Hn and ||z„ - z\\h implies ||2;n||H„ ^ Iklln- □ 

We can now prove Theorem 16. li With no loss of generality we can assume (possibly making 
translations) that ^(7n) = Hn. 

Proof. Set Fn{-) ■= Ti.{-\'yn)- We consider the case when Fnifi^) is bounded first. In this 
case, property (i) in Theorem 15.11 and (jl.Op ensure the uniform (in time, and with respect to 
n) estimate W|(//", =^"/i"(t)) < Cr. Here y^Jl^{t) is the discrete approximation (|5.ip of the 
gradient flow, obtained by the recursive minimization scheme (j4.3p : i.e. we define recursively 

n,0 -n n,k+l ■ ,i • ... r 

fir ■= fJ- , fJ'T IS the unique mimmizer of 

and we define S^^pP'it') := fir'''^^ for all t G [kr, (A;+ 1)t]. Therefore, taking also Lemma [6. 3 1 into 
account, in this case it suffices to show that, with r > fixed, the convergence with moments is 
preserved by the minimization scheme. So, let us assume that converge to fx with moments 
and Fn{fin) is bounded; we consider the minimizers f„ of the problems 

^2{Hn) 3 a ^ Fn{a) + ^WlHM,fin), 
and show that they converge with moments to the minimizer v of the problem 

^2(^) 3 cj ^ F{a) + ^Wi{a,fi). (6.6) 

It 

Notice first we can use A = ;U„ in the inequality 

Fni^n) + ^WlH^^n, f^n) < Fn{X) + ^WIhJX, fin) (6.7) 

to obtain that both '^^n and Fn{vn) are bounded. Since (7„) is tight and 'H{vn\^n) 

is bounded, then (f„) is tight as well, by the entropy inequality (|4.4p . Therefore (f„) has limit 
points with respect to the weak convergence. We will prove that any limit point is a minimizer 
of (j6.6p . so that it must be v. 

Let v' = limfc in the weak convergence, let A G 0^2{H) and let Afc be converging to A 
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in J^2{H), with limsup,^. F„(fc)(Afc) < F{X), whose existence is ensured by condition (ii) in the 
definition of F-convergence. Setting A = A^, n = n{k) in (j6.7p . and using also condition (i) in 
the definition of F-convergence to bound Fn(^f.'^{i'n{k)) from below, we get from (i) and (ii) of 
Lemma 16.51 



F{u') + —W^{iy',fi) < limsup 

< lim sup 

fc'— »oo 



Fn(k){^n{k)) + ^W^2^(i'n.(fc)i/^n.(fc)) 



< F{X) + ^Wi{X,u). (6.8) 

As A is arbitrary, this proves that z^' is a minimizer, therefore u' = u. 

Now, setting A = ;U in (16. 8p . we obtain that all inequalities must be equalities, so that 
limfc (/x„(^.), f„(fc)) = WfC/^i'^)- Indeed, if limsup^ (a^ + 6^) < a + 6, liminffcafc > a and 
liminffc hk > b, then limfc ak = a and lim^ bk = b. 

We shall denote in the sequel by optimal couplings between /i„ and Let S G F(/i, z^) 
a limit point in the weak convergence of and assume just for notational simplicity that the 
whole sequence S„ weakly converges to S. By (j2.6p we get 



/ ||x - yll^dS < liminf / \\x - yW^ dT^n = Wi{n,u), 

therefore S G Fo(;Li,i/). We can now apply Lemma [631 to obtain that {x,y) jj^dT^n 

J^^j^ {x,y) dT,; from the identity 

Wiifi,i^)= \\x\\Hdn+ - 2 / (j;,y)j^dE, 

and from the analogous one with the Hilbert spaces Hn we obtain that z/.„ converge with moments 
to i^. 

In the general case when Fn{p-n) is not bounded we can find, for any e > 0, G F>{F) with 
W2{fi, T^) < £■ By the definition of F-convergence we can also find v"^ converging to u in l3^2{H) 
with limsup„ F„(i>n) < F{D). For n large enough we still have W2,H„{p'"',i^"') < e> so that the 
contracting property of gradient fiows (see Theorem 15.11 (ii) ) gives 

H^2(=^A"(t), ^i>"(t)) + w2iJ^m,'^Ht)) 

< KW2,//„(^/i"(t),^z^"(t))+W2(^/i(t),^i>(t)) < (K + l)e, Vt>0. 
By applying the local uniform convergence property to z/" we get 

limsup sup W2(^A"(*),=^A(i)) < (« + l)e VT > 0. 

n-*oo te[0,T] 

□ 
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7 Wasserstein semigroup and Dirichlet forms 



In this section we establish a general link between the Wassertein semigroups and the semigroups 
arising from natural "gradient" Dirichlet forms, extending Proposition 13.21 to the general case of 
a log-concave measure 7 in H. We denote by K the support of 7 (a closed convex set, coinciding 
with {V < +00} when H = M}^ and 7 = exp(— y)^'^) and, without a real loss of generality, we 
consider the case when 

A(7) = H\^) = H. (7.1) 
We consider, recalling (|1.3p . the bilinear form 



E^{u,v) := / {Vu,Vv)d-i, u,veC^{H). (7.2) 
Jh 

Accordingly, we define the induced scalar product and norm on Cl{H): 

£^^i{u,v) := / uvdj + £^{u,v), \\u\\e-y.i ■= \J £-i,i{u,u). (7.3) 



H 

We start proving that 8^ is closable. We recall that closability means the following: for all 
sequences (n„) C C^{H) which are Cauchy with respect to || • ^ and such that n„ — > in -^^(7), 
we have j — > 0. This is equivalent to saying that the operator V : Cl{H) ^ L?'{'^\H) is 

closable in ^^(7). 

Lemma 7.1 (Closability) The bilinear form {£^,C^{H)) is closable in L'^('~f). 

Proof. Let us denote by Cyl{H) the subspace of C^{H) made by cylindrical functions; by a 
simple density argument we can assume that the sequence (n„) is contained in Cy\{H). We 
claim that closability follows by the lower semicontinuity of f 1— > £^{v,v) on Cyl{H): indeed, if 
this lower semicontinuity property holds, we can pass to the limit as m — > 00 in the inequality 
£ry{un — Um, Un — Um) < £, for n, m > n{e), to obtain £.y{un, u„) < e for n > n{e), i.e. ||ttn||£^ 1 — > 
0. 

So, let (vn) C Cyl{H) be converging in ^^(7) to w G Cyl{H) and let us prove that the 
inequality liminf„<5^(u„,?;„) > £^{v,v) holds. 

We show first that we can assume with no loss of generality that 7 S ^2iH), so that 
fj G ^2{H) for all bounded Borel functions /. Indeed, we can approximate 7 by the log- 
concave measures 7e := exp(— e||x|p)7/Z£ € ^2{H), where \ \ are normalization constants, 
and use the fact that 7^ < '^jZ^ and Z^'~^^ T 7 to obtain the lower semicontinuity of £^{v,v) 
from the lower semicontinuity of all £^^{v,v). The log-concavity of 7^ can be obtained by 
approximation: if (e^) is an orthonormal system in H, then all measures 

1 ^ 
7e,N ■■= ^ — exp(- V(x,ei)2)7 



are log-concave because their projections on any finite-dimensional subspace L D (ei, . . . ,e7v) 
have the form Z~lj exp(— e Yli i 
Proposition 12.21 can be applied. 



have the form Z^ exp(— e Yli i^^ ^«)^ ~ where exp(— F) is the density of [ttl)^^. Therefore 
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We can assume, possibly adding and multiplying by constants, that m := infw > and 
J v'^ dj = 1. By a simple truncation argument we can also assume that inf f„, > m/2, supf„ < 
supu + 1, and set Wn = Vn/\\vn\\2'i obviously — > w in -^^(7) and, as a consequence, u;^7 — > ^^7 
weakly. By Lemma lA.ll we get 

7^(/i|7) > n{wlj\j) - 2^£^{Wn,Wn)W2{fi,wlj) G ^2(^). 

The uniform upper bound on Wn ensures, taking (j2.4p into account, that w"^^ v"^^ in ^^2{H). 
Passing to the limit as n — > 00, the lower semicontinuity of the relative Entropy gives 

W(/i|7) > 'n{v^l\l) - 2 liminf j£^{wn, Wn) W2{fi, vS) V/x G ^2{H). 

n^oo V 

By applying Lemma lA. II again wg get liminf^ ^^(it?^, 

Wn) > S-yivjv), and from the definition of 
Wn we see that the same inequality holds if we replace Wn with Vn- D 

Being £-y closable, we shall denote by D{£^) its domain (i.e. the closure of Cl{H) with 
respect to the norm || • H^:^ J, which obviously can be identified with a subset of -^^^(7), and keep 
the notation £^ for the extension of £r^ to D{£^) x D[£^). In the next lemma we show that 
D{£^) contains Lip^(i^) and some useful representation formulas for the extension. 

Recall that a finite signed measure is a M- valued set function defined on Borel sets that can 
be written as the difference of two positive finite measures; by Hahn decomposition, any such 
measure can be uniquely written as /i = /i+ — /i~ , with nonnegative and /x"'' _L /i~ . The 
total variation is the finite measure defined by /i"*" + 

Lemma 7.2 (iS^ is a Dirichlet form) £^f is a Dirichlet form, Lip^(i^) C D{£^) and 

^j£'yiu,u) < ['u]Lip(K) Vn G Lipft(i^). (7.4) 
Moreover, the following properties hold: 

(i) if H is finite- dimensional, h & H and lh{x) = {h,x), there exists a finite signed measure 
S/j in H supported on K such that 

£j{u,ih)= / udY^h VuGLipfe(ir); (7.5) 
Jh 

(a) if TT : H ^ L is a finite- dimensional orthogonal projection, then 

£^{uo t:,v o tt) = £t^^^{u,v) Vu, v G Lip^(L). (7.6) 

Proof. Let u G Lip^(Er) and let u be a bounded Lipschitz extension of u to the whole of 
H. Combining finite-dimensional approximation and smoothing, we can easily find a sequence 
{un) C Cl{H) converging to u pointwise and with [un]wp{H) bounded. It follows that u„ — > n in 
L^(7) and, possibly extracting a subsequence, Vti„ U weakly in L?'{^\H). Then, a sequence 
{gn) of convex combinations of Un still converges to u in L'^{'^;H) and is Cauchy with respect 
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II ■ 11^7 1- It follows that u € D{8^). A similar argument proves (j7.4p and the fact that 
£y{(j){u), (j){u)) is less than £^{u,u) whenever u G D{£'y) and : R ^ M is 1-Lipschitz. This last 
property shows that £^ is a Dirichlet form. 

Now, let H = M'^', so that 7 = exp(— 1/)^^, and let us prove (i). By the closability of £j, we 
need only to prove that 

[ {Vu,h)dj= [ ud^h yueCliM.'') (7.7) 

for some finite signed measure S/j. The existence of such a measure S/^ (obvious in the case when 
W is Lipschitz, as an integration by parts gives S/j = —{W, h)'y) is ensured by Proposition ! A. 2[ 
Finally, notice that (j7.6p trivially holds by the definitions of E^y and Sn^'y when u G Cl{L), 
because o vr G C^{H). By approximation the equality extends to the case u, v £ Lip^(L). 

□ 

By the previous lemma, there exists a unique contraction semigroup Pt in -^^(7) associated 
to We are now going to compare it with the Wasserstein semigroup ^ ^{t) of Theorem 15.11 
and we shall denote in the sequel vf := 5^5x{t). 

Theorem 7.3 The semigroup Pt is regularizing from L°°(7) to Cb{K), and the identity 

Ptf{x) := / fduf, t>0, fe L°°(7), XGK (7.8) 

provides a continuous version of Ptf. In addition, Pt acts on Lip^(K).- 

[Ptf]up(K)<[f]upiK) t>0, V/GLip,(K). (7.9) 
Moreover, for any fj, G ^2iK), we have the identity 



^fi{t) = J iyfdfi{x), yt>0. (7.10) 

Proof. Assuming (17. 8p . let us first show why it provides a continuous version of Pt: if Xn — > x, 
and we denote by pn the densities of v^" with respect to 7, whose existence is ensured by the 
estimate Theorem lS-lT iii). the contracting property of the semigroup gives that /?„7 fry weakly, 
where p is the density of vf with respect to 7. On the other hand, the same estimate shows that 
'H{pnl\l) are uniformly bounded, therefore pn are equi-integrable in ^^(7) and weakly converge 
in -^^^(7) to p. This proves that the right hand side in (j7.8p is continuous. Finally, (j7.9p is a 
direct consequence of (|7.8p and Theorem 16. l( iii): indeed, choosing S G ro(t'f,ff), we get 



\Pf{x)-Ptf{y)\ 



fduf- / fduf 
H Jh 



{f{u)-f{v))dj:{u,v) 



H 



< [f]up{K) / \\U - V\\ dJ:{u, V) < [f]up(K)W2{l^f , vf) 

< [/]Lip(K)lk-2/||- 
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Step 1: the general finite-dimensional case H = M.^. It suffices to show that the 
class of convex V's for which the equivalence (j7.8p holds for the probability measure 7 = 
exp( — y) is closed under monotone convergence. 

Indeed, if V is smooth with W and all its derivatives bounded, then we know from Section [3] 
that the Wasserstein semigroup coincides with the FP semigroup; therefore from point 3 of 
Proposition 13.21 we obtain (|7.8p . and (j7.9p corresponds to (13.16p . 

By a convolution approximation, we extend the result to all convex Lipschitz functions V 
with f ex.p{—V) dx < 00; indeed, if pn is the density of AA(0,n~^/) with respect to where / 
is the identity matrix in M.^, then Vn '■= V * pn forms an increasing sequence by convexity of V. 
Eventually we obtain all convex functions V with J exp(— y) dx < 00 using the fact that they 
can be represented as the supremum (see for instance [8]) of countably many affine functions 
ii, and applying the equivalence to Vn ■= maxi<j<„£j (notice that for n large enough Vn has at 
least linear growth at infinity). 

So, let us consider a log-concave probability measure 7 = exp(— F)^^' and a sequence Vn T 
V, with Vn real-valued and Vi having at least a linear growth at infinity, such that the statement 
of the theorem holds for all measures 7„ = exp{—Vn) obviously the normalization 
constants Z„ converge to 1 and 7n — > 7 weakly. Notice also that supp7„, = M'^. 

We will also use the fact that both sides in (jZ.Sp are continuous with respect to 7-almost 
sure and dominated convergence, so we need only to check the identity when / £ Lip^(i('). We 
recall that, in general, the semigroup Ptf is related to the resolvent family Rxf by 



Rxfix) 
We define the bilinear form 



POO 

/ e-^'Ptf{x)dt x£K,feCb{K). (7.11) 
Jo 



S'^{u,v) := / {Vu,Vv)d-/n, u, v e Cl( 



Moreover, we denote by the resolvent family of £'^, again related to the semigroup P" on 
Cb{R^) relative to S"^ by exp{-Xt)P^ dt. Using the representation ([TSD of Pp, by Theo- 
rem [6TT] we know that, for all / G Ch(^^), R^f pointwise converge, on K, to the function Fxf 
defined by 



Fx fix) := / e-^* / f i^f dt x G K. 

Jo Jh 



We are going to show that Fxf coincides with Rxf the resolvent family of for all Lipj(J^), 
so that 

POO P 

Rxfix) = / e-^* / fdu^ dt xGK, fG Lip,(K). 

Jo JKfe 

Since, by the injectivity of the Laplace transform, (j7.1ip uniquely determines the semigroup Pt 
on Lip^(iC), (I7.8p would be achieved. 

So, let / £ Lipj,(if); possibly replacing / by a Lipschitz extension to the whole of M*^ with the 
same Lipschitz constant, we can assume that / E Lip^(]R'^) and [/]Lip(Rfc) = [f]hip{K) (indeed, 
neither Fxf nor Ptf depend on this extension). By applying ()7.9p to 7„ one obtains that 
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^[R\f]up{Rf') < [f]up{K), lience Fxf G Lipft(A') and Lemma O gives Fxf G D{£^^). Now, in 
order to prove that Fxf coincides with Rxf, by a density argument it is enough to show that 

a/ Fxfvdj + £^{Fxf,v)= [ fvdj V7;GC2(M'=). (7.12) 

Our strategy is to pass to the hmit as n ^ oo in: 

[ fvdjn = x[ RVvd^n + e^'iRV^v) V^;GC2(M'=). (7.13) 

Let (ei, . . . , Cfc) be the canonical basis of IR'^. By applying the integration by parts formula ()7.5p 
with h = Bi and u = (V-y, ei)R'^f, we get 



/t; d7n = / RVi^v - Av) djn + yZ [ RV {yv, e,) d^l 



(7.14) 



where S^. are associated to the measure 7^. The crucial fact is now the following: we can apply 
Lemma 17.41 to 

k k 

an ■■= {Xv - Av) djn + '^{'^v,ei) dT.2^, := {Xv - Av) dj + ^{Vv,ei) dT,^^, 

1=1 i=l 

with ipn{x) := R^flx), ipoo{x) = Fxf{x) and associated to the measure 7. Indeed, assump- 
tions (i) and (ii) of the Lemma 17.41 are guaranteed by Proposition IA.3I in the Appendix, while 
(iii) and (iv) hold trivially. Therefore, by (|7.14p we have: 

k 

fvdj = lim [ fvdjn= [ Fxf {Xv - Av) dj + f Fxf {Vv,e,) d^^^. 

1=1 

Again, by the integration by parts formula (|7.5|) shows that the last expression is equal to the 
right hand side in (j7.12p . This proves that Fxf = Rxf on K for all / G Lip^(/^). 

Notice now that (|7.1Up holds for smooth V by (j3.3p and Proposition 13.41 By approximation, 
using the stability result of Theorem 16. H we obtain the general case. 

Step 2: from the finite-dimensional to the infinite-dimensional case. We fix a complete 
orthonormal system {ej}.j>i in H and we set Hn ■= spanjei, . . . , e„}, denoting as usual by 
TTn : H ^ Hn the finite-dimensional projections. Setting 7^ = (7r„)^7, it is immediate to check 
that 7n is log-concave in Hn and that H^i^jn) = Hn (if not, we would get that H^{'^) is contained 
in a proper subspace of H, contradicting (j7.ip ). We set: 

£:"(u,z;) := / {Vu^Vv) d-fn u, v G Cl{Hn). 

J Hn 

We should rather write Vh^u for u G Cl{Hn), but since the scalar product of Hn is induced 
by H there is no ambiguity in writing Vu : Hn ^ Hn- We denote by (-Ra)a>Oi Pt (respectively 
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(i2")A>05 Pt) the resolvent family and the semigroup of (resp. <5"). We also know, by the 
previous step, that R^f is representable on Ch{Kn) by ex.p{—Xt) f du^'^ dt: here Kn 
denotes the support of 7„ and u^'^ the associated Wasserstein semigroup in ^^2{Kn)- Since 
7n = (7rn)#7, we have iTn{x) E Kn for all x £ K. As a consequence, by Theorem 16.11 we obtain 
that 

lim i?'^/(7r„(2;)) = lim [ e"^* /" f du^'^^'^'^Ut = [ e"^* /" /di/fdt V x G 

for all / € Cb{H). We shall denote, as in Step 1, by Fxf the right-hand side. Our strategy 
will be, again, to show that R\f{x) = exp{—Xt)Fxf{x) dt. We assume first that f = g o iTk 
is cylindrical function, with g S Lipj(//fc); by applying (|7.9p to 7„ one obtains that \[{R^f) o 
T^nlupiK) < [f]up{K„) < [9]up{u^), hencc Fxf G Uph{K) and Lemma ED gives Fa/ G D{£^). 

Now, let u = n o vr^ and n G C^(Hi); for n > max{/i;,/}, taking into account (j7.6p and the 
identities / = / o 7r„, t; = u o 7r„, we have 

[ vfdj= f vfdjn = a / t;ii^/d7„ + f"Ki?^/) 

= X [ v{Rlf) o 7r„ d7 + £^{v, {R^J) o (7.15) 

Now, {Rxf) ° converge to Fxf in -^^(7) and is bounded with respect to the norm || • H^;^ ^ , by 
the uniform Lipschitz bound. Therefore, by the closability of {Rxf) °T^n ^ F\f in the weak 
topology of D{£). Thus, we can passing to the limit as n — > 00 in ()7.15p to obtain 

/ vfdj = X [ vFxf dj + £^{v, Fxf) yv = uoTri, ue Cl{Hi). 

JH JH 

By the L'^i'y) density of cylindrical functions v, we obtain Rxf{x) = Fxf for all Lipschitz and 
bounded cylindrical functions /. As a consequence, (j7.8p holds for this class of functions. Since 
both sides in (j7.8p are continuous with respect to 7-almost sure and dominated convergence, 
again a density argument shows that the equality (|7.8p extends to all / G L°° (7) . 

By the previous step, we know that (j7.10p holds for the finite-dimensional case. By approxi- 
mation, using the stability result of Theorem 16. 11 the contractivity of gradient flows of Theorem 
15.11 and Lemma 17.41 below, we obtain the general case. □ 

In the proof of Theorem 17.31 we also used the following result. 
Lemma 7.4 Let an, (Too be finite signed measures on H and let ipn, foe : H satisfy: 
(i) snpnWn\{H) < +00 and 

lim / If dan = / fdaoo G Cb{H); 

n^ooj J 

(a) there exist compacts sets Jm C H such that sup„ |o"„| {H \ Jm) ^ as m ^ 00; 
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(in) {^Pn}neNu{oo} equi-bounded and equi- continuous; 
(iv) ipn converge pointwise to 99 oo on suppcToo- 
Then lim„ Jj^ fn dan = Jh^oo da 00 ■ 

Proof. Without loss of generality we can assume that lim„ J"^ ipn dan exists (so that we can freely 
extract subsequences) and < 1, \(poo\ < 1- Let us fix m and assume, possibly extracting a 
subsequence, that ipn ipm uniformly on Jm as n — > 00; obviously (foo = 

m on Jm ^ supp a 00 ■ 

We extend ipm continuously to the whole of H with \ipm\ ^ 1- Then: 




The first term in the right hand side can be estimated, splitting the integration on Jm and on 
H \ Jm, with supj^ \ipn — i^m\\an\{H) + 2\an\{H \ Jm). The second term tends to as n — > cx) 
by our first assumption, while the third one can be estimated with 2|croo|(-ff \ Jm)- Therefore, 
taking first the limsup as n — > 00 and then letting m ^ 00 we have the thesis. □ 



Remark 7.5 (Continuity of Pt) The =^^2(-ff)-continuity of 1 1-^ y5x{t) in [0, +oo[ shows that 
Ptf f pointwise in as t | for all functions / G C{K) with at most quadratic growth at 
infinity, and in particular for / G Cb{K). Taking ()7.9I) into account, the convergence is uniform 
on compact subsets of -fT if / G hipfj{K); by density, Ptf — > / uniformly on compacts sets as 
t I for all / G UCb{K), the space of bounded uniformly continuous functions on H. It is 
also possible to show the regularizing effect Pt{Cb{K)) C UCb{K) for t > 0: indeed, the finite- 
dimensional smooth systems are easily seen to be Strong Feller (see section 7.1 of |13j). and this 
property extends to the general case by approximation. 



8 The Markov process 

In this section we complete the proofs of Theorem 11.21 Theorem 11.31 Theorem 11.51 and Theo- 
rem II. 6|, proving the existence of a unique Markov family of probability measures on 
satisfying 

Ptf{x)=E,,[f{Xt)] VxG/f (8.1) 

for all bounded Borel functions /. The continuity of P^,' will be a consequence of the regularizing 
properties of the Wasserstein semigroup y^it) (in particular the continuity of x ,y5x{t) 
will play an important role). The regularity property (|8.5p . instead, is based on general results 
from [23], that provide a Markov family satisfying a weaker property, and on the continuity of 
1 1— > S^5x{t). As in the previous section we will use the notation uf for y5x{t). 
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Recall also that the regularizing estimate (iii) in Theorem 15.11 give ^ 7 for all x ^ K; 
by the uniform bound on the relative Entropy (which yields equi-integrability of the densities), 
Dunford-Pettis theorem provides the continuity property 

Xn&K, \\xn-x\\^^, /9„7 = i/f", pj = i'f =^ /9n ^ P Weakly in (7) (8.2) 

for ah t > 0. 

Proof of Theorem 11.21 We already proved statement (a) in Lemma |7.2[ Let us consider the 
semigroup Pt induced by S-y, linked to ff by (I7.8p : the semigroup property of Pf can be read 
at the level of I'f , and gives the Chapman-Kolmogorov equations. Therefore these measures are 
the transition probabilities of a time-homogeneous Markov process P^; in 

^[o,+oo[ In particular, 

the Markov property gives the explicit formula 

F,{{Xt, £Ai}n---n {Xt^_, G An^i} n {Xt„ g A^}) (8.3) 

= [ ■■■[ I l<"-^-.(yn)<r^„_,(yn-l)---^Z.r;_,„(yi) 

(with yo = X, = to < ti < ■ ■ • < tn < +00, Ai, . . . ,An £ ^^{K)) for these finite-dimensional 
distributions. The continuity of x i-^ P^;, namely the continuity of all finite-dimensional distri- 
butions, is a direct consequence of (j8.2p and (|8.3p . 

Now, let us prove (j8.5p . In order to apply the general results of [23], we need to emphasize two 
more properties of First, is tight: this means that there exists a nondecreasing sequence 
of compact sets Fm C H such that cap^(-?/ \ Fm) ( cap^ being the capacity induced by 

see |23j). This can be proved using (17. 4p and the argument in [231 Proposition IV. 4. 2]: let 
{xn) C H he a dense sequence and define 

Wn(x) := min < 1, min \\x — xAl > . 

0<i<n J 

It is immediate to check that < w;„ < 1, w;„ | in and [tL'n]Lip{H) ^ 1- Therefore (wn) is 
bounded in the weak topology of D{£^) and converges to in the weak topology of The 
Banach-Saks theorem ensures the existence of a subsequence (n^) such that the Cesaro means 

_ Wni H h Wnfe 

k 

converge to strongly in D{£^). This implies \2,2>\ Proposition III. 3. 5] that a subsequence {vk{i)) 
of {vk) converges to quasi- uniformly, i.e. for all integers m > 1 there exists a closed set Gm C H 
such that cap^(-?/ \ Gm) < 1/m and — > uniformly on Gm- As tL'n{fc{0)) — 
Fm. = Ui<mGi, we have that Wn(k{i)) ~^ uniformly on Fm for all m and cap^(-ff \ Fm) < 1/m. 
If e > and n is an integer such that Wn < e on Fm, the definition of Wn implies 

n 

Fm C \jB{xi,e). 

i=l 
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Since e is arbitrary this proves that Fm is totahy bounded, hence compact. This completes the 
proof of the tightness of £^ . 

Second, S-y is local, i.e. £.y{u,v) = whenever u, v £ D{£^) have compact and disjoint 
support. This can be easily achieved (see also [23l Lemma V.1.3]) taking sequences (tin), {vn) C 
Cl{A{j)) converging to u and v respectively in the norm || • and modifying them, without 
affecting the convergence, so that and Vn have disjoint supports. One concludes noticing that 
S^{f,g) = whenever f,g£ C^{A{'y)) have disjoint supports. 

These properties imply, according to |23l Theorem IV. 3. 5, Theorem V.1.5] the existence of a 
Markov family of probability measures {Px}xeK on C{[0, +oo[; K) (uniquely determined up to 
7-negligible sets), satisfying 

Ptf{x)=E,Af{Xt)] for 7-a.e. X G (8.4) 

for all bounded Borel functions f on K (here E^; is the expectation with respect to P^)- Now, 
since H is separable we can find a countable family A of open sets stable under finite intersections 
which generates ^{H); choosing / = lyi in ()8.4p and ()7.8p . and taking into account that A is 
countable, we can find a 7-negligible set N C K such that i^f{A) = K^llAiXt)] for all A £ A, 
t £ Q and all x G \ A^. As a consequence, Z/'* is the law of Xf under P^, for all x £ K\N and 
all t £ Q. We can now use the continuity of the process Xt and of 1 1—> i^f to obtain that i/f are 
the one-time marginals of P^; for all x £ K \ N. 

We prove now path continuity under for x £ K \ N, using the property i^f <^ 7: we adapt 
the approach of [H] to our setting. By the Markov property we obtain that all finite-dimensional 
distributions of P^ and P^; coincide; as a consequence, if we denote hy i : n ^ K^o,+oo[ ^j^^ 
(obviously measurable) injection map, i^P^ = P^- By the Ulam lemma, we can find compacts 
sets KnC n with Fx{Kn) T 1; now, if 5 C [0, +00) is bounded, countable and Bs C K^'+^I is 
the measurable set defined by 

Bs := {uj £ il. : the restriction of w to S" is uniformly continuous} , 

from the inclusion Bs H D Kn we obtain ¥x{Bs) > ¥x{Kn), hence ¥x{Bs) = 1. A well known 
criterion [30\ Lemma 2.1.2] then gives that = 1. This proves that 

P*(J]) = 1 for 7-a.e. x£ K. (8.5) 

In order to show the first part of statement (c), fix x £ K, and define Bs as above, with 
5" c]0, +oo[ satisfying e := inf5 > and sup 5 < 00. Since the law of ^(.(x) is absolutely 
continuous with respect to 7, we know from ()8.5p that ¥x^{Bs~e) = 1 Pa;-almost surely. Taking 
expectations, and using the Markov property, we get ¥x{Bs) = 1- Again the same argument in 
[3(71 Lemma 2.1.2] shows that P* (C(]0, +oo[; i?)) = 1. 

Finally, we use the representation ()7.8p and the fact that Pt is selfadjoint (due to the fact 
that £-y is symmetric) to obtain 

/ l\{x)i:{y)duy{x)d^{y)= [ [ ^{x)i^{y)duf{y)dj{x) i; £ L^{j). (8.6) 

JK JK JK JK 

This means that the process P^ is reversible. 

Proof of Theorem 11.31 It is a direct consequence of the estimates in Theorem 15.11 and of the 

coincidence, proved above, of the law of Xt under P^; with . □ 
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Proof of Theorem 11.51 We shall denote by f"'^, (resp. z^f , E-j) the transition probabilities 
and the expectations relative to (resp. Px)- From Theorem 16. II we obtain: 

Xn^Kn, \\xn-x\\^0, xeK =^ z^"'''" ^ z^f in ^2(^)- (8.7) 

We shall prove by induction on m that E"^[/(Xj^, . . . — > Ej:[/(Xj^ , . . . , Xt^)] for all 
/ G CbiH"^). Obviously we can restrict ourselves to / G Lip^(i^™') and the case m = 1 
corresponds to (j8.7p . So, let us assume the statement valid for m > 1 and let us prove it for 
m + 1. Let / € Lip{,(i/"^"'"^), and let n„ : H — > Kn be the canonical projection. By the weak 
convergence of 7„ to 7, we have ||n„(y) — y|| — > for all y G K. As a consequence, the induction 
assumption gives E^^^^^lf {y , Xt^ , . . • ^ Ey[f{y,Xt^,. . . ,Xt„+J]. Since / is Lipschitz 

we have also 

Jim Ej\^(^)[/(n„(y),Xt,, . . . = E,[/(2/,Xt„ . . .,Xt^^,)] Vy G K. (8.8) 

Thanks to (j8.8p and Lemma 17.41 we can pass to the limit as n — > 00 in the identity 
E^[/(Xi,,...,Xi„^J] = [ E^[/(y,Xi„...,X,„^J]dz.,7"(y) 

= 1^ Eii^^(^) [/(n„(y), X,, , . . . , Xt^^, )] dz.;;'^" (y) 

to obtain ES[/(Xt, , . . . , Xt„,^, )] ^ E, [f{y, X*,, . . . , X^^^, )] ciz.,- (y) = E, [/(X^, , . . . , Xt^^, )] . 
This proves statement (a). Statements (b) and (c) follow at once by the tightness Lemma |8. II 
below. □ 

Lemma 8.1 (Tightness) Let 7„ and 7 as in Theorem \1.5l let x G -^'(7) and let x„ G K('yn) 
be such that x„ — > x. For all < e < T < +00, h G H, the laws of {{Xt,h)H,t G [e,r]) under 
P"^, n G N, form a tight sequence in C([e,T]). Moreover the laws of {{Xt,h)H,t G [0, T]) under 
P"^, n G N, form a tight sequence in C([0,T]). 

Proof Let = H^{ln), Kn = K{jn) and P!;^ = /^^ P^ c?7n(2;). For any h e Hn we have by 
the Lyons-Zheng decomposition, see e.g. [Ml Th. 5.7.1] that, under P"^, 

{h, Xt - Xo)h^ = 1 Mi - i (iVT - NT-t), Vt G [0, T], 

where M, respectively A^, is a P"^ -martingale with respect to the natural filtration of (Xt, t G 
[0, T]), respectively of (Xt-i, t G [0,T]). Moreover, the quadratic variations (M)j, {N)t are 
both equal to t • . By the Burkholder-Davis-Gundy inequality we can find, for all p > 1, 

a constant Cp G (0, +00) such that 

El[\{h,Xt-X,)^J]<Cp\\hrHjt-s\^/', t,.G[0,r]. (8.9) 

Let us denote by n„ : H — > Hn the duality map satisfying {Iln{h),v)H„ = {h, v)h for all h G Hn- 
Then, choosing v = Iln{h)i from (|1.9p we get ||n.n(/i)||//„ < so that (jS.Op gives 

E" [\{h,Xt - X,)^r] < - s\'P'\ t, s G [0,T]. 
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Then tightness of the laws of (X*, h) h under P!^^^ in C([0, T]) follows e.g. by [301 Exercise 2.4.2]. 

Let e > and let us prove that sup„ 'H(;/^''^"|7„) < +oo. Since x ^ K there exist > 
such that 7(i?/?^(x)) > 1/2, so that there exists such that ^n{BR^{xn)) > 1/2 for all n > n^. 
Let Un := 7„(-|Sij^(x,i)); then for n > we have ^^n) < k^Wj^I^^x^, z^n) < '2.k'^B?, and 

since 'H(^'n,|7n) = — ln7n(i?/jj by (iii) of Theorem 15.11 we get 

" |7n) < + In 2 Vn > n,. 

Let Pn (resp. Pn) be the law of {Xt,t G [e, T]) under P^^ (resp. P!^^). Let us prove that 
<. Pn and 

dP — 

= p^{Xe) P„-almost surely, (8.10) 

where p" is the density of f"'^" with respect to 7^. For any bounded and Borel functional 
^ : C([0, r — e]]H) 1-^ M, we have by the Markov property: 

= E^ip^iXo) HX.)) = Klip^iX,) HX,+ .)), 

where in the last equality we use stationarity, and (|8.10p is proven. 

Let now h £ H and let Pjj (resp. P^) be the law of {{Xt,h)H,t G [s,T]) under P"^ (resp. 
P"^); since the relative Entropy does not increase under marginals [21 9.4.5], from (|8.10|) we get 

?^(P:^|P;) < n(Pn\Pn) = nu^"hn). 

It follows that sup„ 'H(PjjlP^) is finite. By applying the entropy inequality (|4.4p . tightness of 
(Pn) implies tightness of (Pjj). □ 

Proof of Theorem 11.61 Let p G I3^2{H) be an invariant measure for {Pt)t>o- Then, by (jT.lOp . 
S^p{t) = p is a constant gradient flow of ^(-17) and therefore, by (|3.20p . Ti.{p\j) < 7^(1^17) for 
all u G ^2{H). Since t ^ tint is strictly convex, the unique minimizer of "^(-17) in 3^2{H) is 
7, and therefore /i = 7- □ 



A Some properties of log-concave measures 

In this appendix we state and prove some useful properties of log-concave measures and of convex 
functions used throughout the paper. 

First of all, for lower semicontinuous convex functions 1/ : R*"' — > M U {+00} (i.e. the 
typical densities of log-concave measures), we recall that the properties / exp(— y)dx < -|-oo, 
V{x) — > -|-oo as ||x|| — > -|-cxD and V{x) +00 at least linearly as ||x|| +00 are all equivalent: 
indeed, the equivalence between the second and the third one simply follows by the monotonicity 
of difference quotients along radial directions, and clearly a linear growth at infinity implies 
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finiteness of the integral. On the other hand, if the integral is finite, a crude growth estimate on 
V can be obtained as follows: assuming with no loss of generality that {1^ < +00} has nonempty 
interior, we can find a ball B and M < +00 such that V < M on B; then, on the convex cone 
Cx generated by x and B, we have the inequality V < M + V^{x). Changing signs and taking 
exponentials we can integrate on Cx to obtain 

^V+{x)+M yff ^-V(y) ^fc(C^) ^ as \x\ ^ +00. 

Lemma A.l (Variational characterization of £^{u,u)) Let'j S 0^2{H) he a non- degenerate 
log-concave measure, and let u he a bounded cylindrical function, with inf u > and J dj = 
1. Then y^£^{u, u) is the smallest constant S satisfying 

n{fih)>n{u'^jh)-2SW2ifi,u^^), yfi£^2iH). (a.i) 

Proof. First, we realize that this is essentially a finite-dimensional statement. Indeed, if 
TT : H ^ L is a finite-dimensional orthogonal projection such that u = uott, a simple application 
of Jensen's inequality gives [3l Lemma 9.4.5] 7Y(//|7) > 'H(vr#/i|7r^7). Since PV2(7r#/i, tt^7r#7) = 
W2(7r^//, vr^(n^7)) < W2{^i,u'^'y) and £-y{u,u) = £-,^^^{u,u), we need only to check the analog 
of (IA.1|) with 7 replaced by 7r#7 and H replaced by L. So, from now on we shall assume that 
H = M^' for some integer k. 

In Lemma 13.31 we proved that 



n{^Ji\^) > n{u^^h) - 2^£^{u, u) W2{fi, A), e ^2(m'=) 

when 7 = ex.p{—V)^^ with V smooth, convex, and W and all its derivatives are bounded (it 
suffices to use the Schwartz inequality to estimate from below the scalar product in (j3.18p ). By 
monotone approximation (see Step 2 in the proof of Theorem 17. 3p the same inequality holds for 
all log-concave 7 in M'^. 

It remains to show that y^£j{u, u) is the smallest constant with this property. In order to 
prove this fact, we fix s G C^(M^';M'^) with support contained in the interior of {V < +00}, 
and consider the maps := i + es and the measures /i^ = (te)#(ii^7), so that W^2^(//e, m^7) < 
J ||s|pM^ d'j. On the other hand, the area formula gives that the density of fj.£ with respect to 
is given by fg, where 

_ exp(-y) 1 
|detVt,| ' 

(notice that for £ small enough is a diffeomorphism which leaves {V < +00} invariant). Since 

?^ (/i,|7) = / f,lnf,dx+ [ f,V dx= [ ln(/, o t,y dj+[ {V o t,)u^ dj 

= n{u^-f\-f)-e {V ■s)u^exp{-V)- {VV,s)u^exp{-V)dx + o{e) 

= W(ti%|7) + 2e/ u{Vu,s)d'y + o{e) 
Jr* 
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from (jA.ip we get 

/ u{Vu,s)dj>-S\l I 115112^2^7. 




Since U27 is concentrated in the interior of {V < +00}, we can approximate in L2(n^7;M*'') the 
function —Vlnn with s it follows that S > y^Sylu, u). □ 

In the next two propositions, borrowed essentially from [36], we show that, for convex func- 
tions [/ : M*^ — > M, the growth at infinity of \/U is always balanced by the factor e~^; this leads 
to uniform bounds and tightness estimates for the measures \VU\e~^ , under uniform lower 
bounds on U. 

Proposition A. 2 Let U : ^ M U {+00} be convex and lower semicontinuous, with U{x) — > 
+00 as \\x\\ +00, {U < +00} having a nonempty interior, and set 7 = exp(— C/)^'^. Then, 
for all unit vectors /i G R'^ there exists a unique finite signed measure in M.^ supported on 
{U < +00} such that 

Jj^k dh 

Moreover, we have \Tj^\{M.^) = 2 exp(— min + t/i)) dy. 

Proof. Assume first k = 1; the function t 1— > exp{—U{t)) is infinitesimal at infinity, non- 
decreasing on a half-line (— oo,to) and non-increasing on (tg, +00), where to is any point in the 
interior of {U < +00} where U attains its minimum value. Then exp(— ?7) has bounded variation 
on R and the total variation of its distributional derivative ^ exp(— [/) is representable by: 



^dj= I udJ^h yuGCliR''). (A.2) 



dt 



1 ^.-^1 ^ .-u 



It follows that 1^ exp(— [/)|(R) = 2 exp(— J7(io)) = 2 exp(— min [/); by definition of distribu- 

tional derivative, = — ^exp(— f/) fulfils ()A.2p when the function u is compactly supported, 
and a simple density argument gives the general case. 

In the case > 1 we denote Uy{t) := U{y + th); since U has at least linear growth at infinity, 
it is easy to check that exp(— nnnJ/y) is integrable on . Now, notice that Fubini's theorem 

implies the existence of and its coincidence with the measure J^x dy, i.e. 



^^e-^dydt = j^^ (^l^±u{y + th)e-^y('Ut^ dy, 



u G ch 



On the other hand, if we denote by A the projection on of the interior of the convex set 
{U < +00}, and by C the projection of {U < +cxo}, we have that {Uy < +00} has nonempty 
interior for all y G j4, while Uy is identically equal to -|-oo for all y G \ C; points y in 
C \ A correspond to projections of boundary points of {U < +00} where h is tangential to 
the boundary, and the co-area formula gives that this set of points is .if'^"^ -negligible in h-^. 
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As a consequence, = 2 exp(— min ^7j^) for ^-a.e. y e . A general result [21 

Corollary 2.29] allows to commute total variation and integral, so that 

1/ J:^y dylCM.^) = I \T.^y\(R)dy = 2 I exp(- mmU(y + th)) dy. 

Jh^ Jh^ Jh^ 



□ 



Proposition A. 3 (Continuity and tightness) Let V„ : M'^ ^ ]RU{+oo} be convex and lower 
semicontinuous function, with Vn ] V and J exp(— Vi) < +oo. Then for all unit vectors h 
there exist compact sets Jm. C M'^ such that: 

\J:1"\{R'' \ Jm) < — V71, m>l. (A.3) 



v„ 

h ' 



Furthermore, S^" — > T,^ in the duality with Cb{R''). 

Proof Let A G M, S > be such that Vi{x) > A + B\\x\\ for all x E M'^. We set := S 
S := Sjf . We first notice that Vn{y + th) >Vi{y + th)>A + B\\y\\ for all y £ h^. Therefore, 
taking into account the representation of |S^|(IR'^) given by the previous proposition, we obtain 
that IS^KR*^) is uniformly bounded. On the other hand, since exp i-Vn)^'' weakly converge to 
exp(— y)^*^ (by the dominated convergence theorem) from ()A.2p we infer that S weakly 

in the duality with (M^), and then in the duality with Cc(M'^). 
We will prove that 

lim IS^KM'^) = \mR^). (A.4) 



n— >oo 



Before proving ()A.4p . we show that it implies ()A.3P : consider a dense sequence {xj) in M*^ and 



set, for p, I > 1 integers, := Uj^]^S(xj, 1/p). It is enough to prove that for all p there exists 



1 

I = l{p) such |S"|(]R'^ \ A^) < 2^P/m for all n: indeed, in this case Jm '■= l^p^f(p) is a compact 
set such that |S"|(]R^ \ J„i) < 1/m for all n > 1. If, for some p, we can not find such I, then 
for ah I there exists n{l) such that |S"(')|(Af) < ISJ^^'^KM*^) - 2~P/m. Since n{l) must tend to 
+00 as / — >— > oo, and any open ball Br{0) is contained in Af for I large enough, by the lower 
semicontinuity of the total variation on open sets (see for instance [21 Proposition 1.62(b)]) we 
find: 

|S|(5^(0)) < liminf |S"(')|(B^(0)) < liminf ) < \J:\{R'') - 2~p /m. 

Letting r | oo we obtain a contradiction. Therefore ()A.3P is proven. 

In order to prove (1A.4|) . taking again into account the representation of jS^KM'^) given by 
the previous proposition and the dominated convergence theorem, it suffices to show that, with 
y G /i^ fixed, min V^(y + th) converges as n — > oo to m.mV{y + th). By monotonicity we need 

only to show that 

liminf minl/'„(y + t/i) > m.mV{y + th). (A. 5) 

Let n{k) be a subsequence along which the liminf is achieved, let be minimizers of t i-^- 
^n{k){y + if^)i a-iid assume (possibly extracting one more subsequence) that tk — > t. The lower 
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semicontinuity of T 4(p) g ives linifc Vn{^k){y + tkh) > liminffc K{p)(y + *fc^) > yn{p){y + th). Letting 
p — > cx) we obtain (|A.5p . Finally, the tightness estimate allows to pass from convergence of S*^ 
in the duality with Cci^^) to the convergence in the duality with Cb(M*'). □ 
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